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Rationale 



This publication, os the title implies, is about 
going beyond multiple-choice tests in order to 
assess student achievement. Why is this neces- 
sary? What has impelled the writing of this man- 
ual? To answer these questions, we first must 
take a look at how ideas about science teach- 
ing have changed. So, we begin this manual 
with a small history lesson in the development of 
science curricula. 

Brief History of Science 
Curricuium Deveiopment 

Welch (1979) characterized the social forces 
leading to science education reform of the 
1960s as scientists' concern about outdated 
curricular materials, science manpower short- 
ages, and the threat of Soviet technological 
supremacy. These forces set the stage for mas- 
sive federal support for science curriculum 
development. 

For approximately 20 years, the National 
Science Foundation supported extensive cur- 
riculum development and teacher inservice 
training programs in science education. Their 
curricula differed from old programs in its 
modernization of content, its emphasis on flex- 
ibility and variety in instructional tools, and 
the greater attention it gave to an overriding 
conceptual scheme, students' attitudes toward 
science, and the nature of scientific inquiry or 
hands-on student work. 

in spite of aii the support for curricuiar change 
over this period, there were aiso forces resistant 
to change, inciuding the foiiowing: 

• Many teachers were inadequateiy 
prepared in science and math, 
particuiariy at the eiementary and 
middie schooi ieveis, and were insecure 
about making curricuiar changes. 

• Concern in the 1970s focused more on 
speciai remediai ciasses, the basic skiiis, 
and mainstreaming than on science. 

Weich (1979), in summarizing the achievements 
of the curricuium reform of the 60s and 70s, 
reported: 

• Curricuiar aiternatives were deveioped 
and disseminated (PSSC— Physicai 
Science Study Committee; BSCC— 

Bioiogicai Sciences Curricuium Study; 

SCiS— Science Curricuium improvement 
Study). 



• Content was updated. 

• New curricuiar materiais emphasized 
science processes and hands-on work. 

• Science manpower needs were met. 

The reform of the 1990s and beyond differed 
from the eariier science curricuium reform in 
that it was a subset of a much iarger educa- 
tionai reform movement fueied by a concern 
that our students wouid not be internationaiiy 
competitive as aduits. Changes were proposed 
across the curricuium, emphasizing higher-order 
thinking skiiis and probiem-soiving. 

Science For All 

The emphasis on science education in previ- 
ous decades that resuited in the deveiopment 
of curricuium materiais provided a framework 
on which the 1990s' efforts buiit. Fiowever, the 
1990s differed from prior curricuiar reform move- 
ments in that they were geared toward scien- 
tific iiteracy for aii students (Nationai Research 
Councii, 1999), not just better science educa- 
tion for future scientists. Such iiteracy is criticai if 
the generai pubiic is to have a basis for making 
informed decisions about issues iike nuciear 
power, personai heaith, the environment, 
reproduction (Loucks-Fiorsiey, Brooks, Carison, 
Kuerbis, Marsh, Padiiia, Pratt, & Smith, 1990), 
and stem ceii research. 

Continuing this emphasis on science for aii 
students. Project 2061, a reform effort of the 
American Association for the Advancement of 
Science issued a 1989 report caiied Science for 
All Americans (www.project2061 .org/publications/ 
sfaa/online/sfaatoc.htm). This report suggested the 
knowiedge, skiiis, and attitudes that students 
shouid have as a resuit of their K-1 2 science 
instruction. The "science for aii" theme was 
aiso evident in the Nationai Science Education 
Standards (NSES), produced and distributed 
by the Nationai Research Councii in 1995 and 
avaiiabie oniine at www.nas.edu. in the NSES 
document, the Nationai Research Councii (1999, 
p. 2) states, "The intent of the Standards can be 
expressed in a singie phrase: Science standards 
for aii students. . .The Standards appiy to aii 
students, regardiess of age, gender, cuiturai 
or ethnic background, disabiiities, aspirations, 
or interest and motivation in science." This 
"science for aii" orientation has most recentiy 
been reflected in the Eiementary 
and Secondary Education Act of \/ 



2001, better known as the "No Child Left Behind 
Act." A re-emphasis on science testing as part of 
school accountability is also part and parcel of 
the "science for all" orientation nature of this Act. 

Science Inquiry 

"Science for all" is not the only theme emerg- 
ing in science education. One can also track 
the development of an emphasis on sci- 
ence inquiry. The National Science Teachers 
Association (Texley & Wild, 1997, p. 62) notes 
that the National Science Education Standards 
marks a move "away from presenting informa- 
tion to encouraging student discovery." Tobin, 
Kahle, and Fraser supported this move away 
from content presentation to a more inquiry- 
based approach. In Windows info Science 
Ciassrooms, they noted (1990, p.l51): 

If an instructional activity is to be 
consistent with the nature of science, 
it must engage students in attempt- 
ing to generate answers to questions, 
rather than merely illustrating what is 
pronounced by assertion to be true in 
a textbook. When laboratory activities 
or demonstrations are used to illus- 
trate the validity of what is known, the 
emphasis is placed disproportionately 
on what we think we know rather than 
on how we know it. In such situations, 
students are deprived of opportunities 
to think, predict, analyze, and discuss; 
that is, they are deprived of opportuni- 
ties to do science (emphasis added). 

The National Standards document also argues 
that students must do science (National Research 
Council, 1999, p. 2): 

“The Standards rest in the premise 
that science is an active process. 

Learning science is something that 
students do, not something that is 
done to them. ‘Hands-on’ activities, 
while essential, are not enough. 

Students must have ‘minds-on’ expe- 
riences as well.” 

This document goes on to note: "when engaging 
in inquiry, students describe objects and events, 
ask questions, construct explanations, test those 
explanations against current scientific knowledge, 
and communicate their ideas to others. They iden- 
tify their assumptions, use critical and logical think- 
ing, and consider alternative explanations. In this 
way, students actively develop their understand- 



ing of science by combining scientific knowledge 
with reasoning and thinking skills." 

In other words, "minds on" means that both 
students and their teachers need to pay atten- 
tion to the quality and sophistication of student 
thinking. For example, teachers may need to 
examine the quality of students' efforts to draw 
conclusions from data and determine what 
next instructional steps are needed to improve 
this particular thinking skill. 

Changing Assessment 
Practices 

Because of the changing emphases in science 
education, traditional assessment practices 
must also undergo a metamorphosis. The impe- 
tus for students to do science fuels an impetus 
for teachers to find new methods of assessment; 
methods that allow them to track student prog- 
ress toward the inquiry-based standards of sci- 
ence education that emphasize the quality of 
student thinking and student products. We are 
living in an era where the accumulation of facts 
is less important than the ability to manipulate 
or apply knowledge. Therefore, we can no lon- 
ger rely solely on multiple-choice, fact-based 
testing. We must develop and use assessment 
methods appropriate to our higher expecta- 
tions of students. This manual is intended to aid 
teachers in such development activities. It is 
written in response to the following statement, 
taken from Inside the Black Box by Black and 
Wiliam (1998, pp. 15-16): 

Teachers will not take up attractive 
sounding ideas, albeit based on 
extensive research, if these are pre- 
sented as general principles, which 
leave entirely to them the task of 
translating them into everyday prac- 
tice— their classroom lives are too busy 
and too fragile for this to be possible 
for all but an outstanding few. What 
they need is a variety of living exam- 
ples of implementation, by teachers 
with whom they can identify and from 
whom they can both derive convic- 
tion and confidence that they can do 
better, and see concrete examples of 
what doing better means in practice. 

This manual attempts to provide some "living" 
and "concrete" examples that will aid teachers 
in developing new assessment methods and 
encourages teachers to work together in doing 
so. The manual is particularly timely, in that 
assessment of science achievement is man- 
dated in the No Child Left Behind Act'. 



Current Viei/i/s on 

ASSESSaAENT 



Introduction 

Educational systems promote student growth in a variety of dimen- 
sions. Basic content knowledge can be effectively assessed with 
multiple-choice and completion tests. However educational reforms 
have become more concerned with higher-order cognitive dimen- 
sions (problem-solving, creativity), social dimensions (communication 
skills, ability to work in groups) and other dimensions (life-long learn- 
ing). While they are objective and efficient, traditional assessment 
measures may not serve these kinds of goals as well os other types of 
measures. Before we can choose on accurate, efficient method of 
assessment, we must clearly understand the goals of science instruc- 
tion. Do these goals encompass only the basic memorization of facts? 
If so, our traditional methods may be sufficient. If we wish to institute a 
science program that encourages dimensions that go beyond these 
basics, we will need to develop a repertoire of additional assessment 
methods. The organization of this manual is intended to aid teachers 
in developing expertise in identifying learning goals, choosing assess- 
ment methods, and communicating assessment results in such o way 
that student performance is enhanced. 






Identifying Learning Goais 

Let us begin by looking at the goals of science instruction. Only by 
clearly defining what we want students to know and be able to do 
can we then choose and plan effective assessments that accurately 
measure student achievement of these goals. 

According to McTighe and Wiggins (2004), the true issue being 
debated by assessment reformers is not whether some assessment 
methods are superior to others, but rather what is worth assessing, 
given limited assessment time. The debate about assessment, then, is 
o "value" debate. What goals or outcomes do we value for students? 
Kohn (1999, p. 216) expresses this "value" idea os "Content: Things 
Worth Knowing" and suggests that "a good deal of what students 
ore required to do in school is, to be blunt, not worth doing." By dis- 
cussing the relative value we place on particular goals or outcomes, 
assessment experts ore encouraging assessment reform. They direct 
curriculum developers and teachers to examine the curriculum itself 
and to ensure that goals of learning ore clearly expressed, relevant to 
students, frequently challenging, and properly assessed. 

If the goal is for students to learn basic facts and skills, 
then paper-and-pencil tests and quizzes generally provide 
adequate and efficient measures. However, when the goal 
is deep understanding, we rely on more complex perfor- 
mances to determine whether our goal has been reached. 
(McTighe & Wiggins, 2004, p. 141) 
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CHAPTEl? ONE 



“I think what’s 
going on is 
something 
more rodicoi 
than rethinking 
testing. What 
we’re reoiiy 
doing is 
rethinking our 
purposes.” 

(Wiggins, 1992, p. 37) 



It is important to remember that you are making ohoioes about 
assessment right now. These ohoioes may be constrained by what you 
have always done, what others think you should do, what you under- 
stand about assessment, or what you feel students expect you to do, 
but they are choices nonetheless. This manual is designed to provide 
you with the support you and other teachers at your school need to 
begin a process of defining the outcomes you value for students in sci- 
ence and developing assessment practices that encourage student 
progress toward desired ends. CHAPTER 2 provides background knowl- 
edge and practical advice to help you set instructional goals. 

This chapter also reminds us that how and what we test sends a 
clear message about what is valued. Traditionally, we have almost 
exclusively valued students' success at retaining and bringing forth 
a sample of the information they have internalized. When a teacher 
only emphasizes factual knowledge on tests, students conclude 
that remembering facts is the goal. When students are not given an 
opportunity to retest or improve their work, they may conclude that 
improvement is not valued. If higher-order thinking, problem-solving, 
and critical thinking are to be valued, then classroom assessments 
need to lend value to them. It is imperative for us to know our goals 
before choosing assessment methods. 

Choosing Assessment Methods 

Once instructional goals have been identified, assessment planning 
can begin. It is important to match the assessment to the learning 
goal to ensure that the assessment can accurately measure the goal. 
For example, let us suppose an instructional goal states, "Students 
will accurately, competently, and safely use scientific equipment." Is 
a multiple-choice test the best assessment method to choose for this 
goal? Wouldn't it be better to have students demonstrate fhe use of 
scientific equipment (Bunsen burners, microscopes, wave tables) if we 
wish to ascertain their competency for this task? Such demonstrations 
are often termed "student performances," and assessment meth- 
ods used to judge them can be called "performance assessments." 
According to Wiggins (1989), such assessments require the following: 

Tests should involve real-life tasks, performances, or challenges 
that replicate the problems faced by a scientist, historian, or 
expert in a particular field; thus, they are complex tasks rather 
than drills, worksheets, or isolated questions. 

Students should understand up-front the criteria on which 
their work will be judged and be able to apply the criteria to 
their work. 

Students should be asked to demonstrate their control over 
the essential knowledge being taught by actually using the 
information in a way that reveals their level of understanding. 

Others argue that performance assessments should 

Require students to perform tasks that include the highest skill 
levels of problem finding and solving to include role-playing, 
"real-life" simulations, investigation, major projects, and creative 
depictions. (Wiggins, 1992; Glatthorn, 1998) 

Use power verbs (such as research, analyze, evaluate, and 
depict) to reinforce that the student is demonstrating what he or 
she can do with information. (National Research Council, 1999) 



• Where appropriate, allow students to be involved in creating 
the criteria against which their performance will be judged. 
(Stiggins, 2001) 

• Include audiences in addition to the teacher to validate and 
judge student performances (e.g., scientists, other students). 

(Kohn, 1999) 

Kohn also introduces the need for assessment to be reolity-bosed; 
that is, based upon work that is done in the "real" (as opposed to the 
educational) world. Kohn expresses this os: "Thus, our question is not 
merely. What's the task? But, How does the task connect to the world 
that the students actually inhabit?" And he reminds us: "Children ore 
people who have lives and interests outside of school, who walk into 
the classroom with their own perspectives, points of view, ways of 
making sense of things and formulating meaning. What we teach and 
how we teach must take account of these realities" (1999, p. 219). 

How do we infuse our teaching and our assessments with "reality"? 
What is a "real-world" task? A few examples of generic kinds of 
tasks that have students using or applying information in ways that 
go beyond just recalling or recognizing correct information include 
the following: 

• Leading a group to closure on an issue 

• Collecting, analyzing, and interpreting data about the 
success of o program, product, or event 

• Researching both sides of o controversy and reporting it 
objectively 

• Developing criteria for rating the quality of o product, 
proposal, or recommendation 

Such tasks are recognizable os port of many adult work environments 
and con be infused into the work demanded of students. For on aca- 
demic, science-related example of an assessment involving a real- 
world task, see FIGURE 1.1. 

CHAPTER 3 of this manual builds upon the idea of implementing per- 
formance-, authentic- and reality-based assessment by examining 
several different assessment methods that go beyond multiple- 
choice testing. These assessment methods include those found in 
FIGURE 1.2. Chapter 3 attempts to provide clear definitions of different 
assessment types and then suggests ways each type could be used 
in the science classroom. 

Why do teachers need such a diverse toolbox of assessment meth- 
ods that go beyond multiple-choice testing? To answer this question, 
let's first examine o typical, traditional classroom scenario. 

SCENARIO: The teacher teaches a unit on soil formation and 
then gives o unit test with multiple-choice, short-answer, 
and matching items to assess students’ retention of the 
information. Students ore told about the test one week in 
advance, and they bring no resource materials with them 
to the test. Students’ tests ore scored and returned and form 
the basis of the six weeks’ grade. 

Proponents of assessment reform argue that past assessment prac- 
tices (os the ones depicted in the above scenario) ore inadequate. 



Glatthorn (1998, p. 8) characterizes such scenarios as "teaching to 
the test" and offers this ciassroom iiiustration: 

Students will have to take a short-answer objective test 
assessing their knowledge of the legislative process os 
employed in their state. A typical question asks students 
to define bill and law. The specific content of the test is 
confidential, with the test administered under conditions of 
high security. The teacher has identified the questions the 
test is likely to ask by reviewing previous editions of the test. 

The teacher prepares practice material on test-like items. 
Students spend most of their class time completing the 
practice exercises and checking their answers. 

It is not that objective, fact-based tests ore not important. As we 
stated at the beginning of this chapter, such tests are effective and 
efficient means of measuring basic knowledge. However, such fact- 
based exams should not be the only type of assessment method 
used by the teacher. 

Fundamental problems with such fact-based, traditional assessment 
practices include: 

• Narrowness of scope 

• Limited expectations of students 

• Overemphasis on memorizing isolated facts, rather than 
concentrating on connections and relationships 

• Lack of student ownership in the learning process 

• Lack of incentives for student improvement in their work 

CHAPTER 3 is included in this manual to give teachers expanded alter- 
natives to traditional assessment practices. 

FIGURE 1.1 

Sample Assessment Utilizing a Reai-Life Task 

ASSIGNMENT: 

Research with your team the value and uses of whales across time and cul- 
tures. Analyze and evaluate the practical uses vs. environmental protection 
issues, and develop support for both. Choose a position and be prepared to 
justify and present your position to the doss in o convincing manner. 

ASSESSMENT METHODS: 

1. Research quality will be assessed through teacher observation of 
teamwork and teacher review of a team journal of completed group 
work. 

Teams ore not allowed to proceed with developing their presentations until they 
con show they hove adequately researched the topic. 

2. Oral presentation skills will be assessed by peers and teachers using 
0 rubric. 

Source: Adapted from High Success Network training materials. Outcome- 
Based Education Summer Conference, Charloffe, NO, 1992: High Success 
Nefwork, P.O. Box 1630, Eagle, CO 81631. 
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FIGURE 1.2 

Assessment Methods 



Observe students using 

• Informal observations 

• Structured observations 

Soliciting information from students through 

• Interviews 

• Self-assessment questionnaires 

Evaluate students' work using 

• Open-ended questions 

• Performance tasks 

• Journals 

• Exhibitions and culminating demonstrations (I.e., science fair projects) 

• Portfolios 

V / 

Communicating Assessment Resuits 

After Instructional goals ore set and assessments ore performed, 
teachers need to communicate assessment findings to students and 
to parents. CHAPTER 4 provides on overview of assessment Instruments 
and grading schemes that con provide timely and essential feed- 
back to learners. The chapter begins with on overview of the types of 
rubrics (scoring guides) available to teachers and the effectiveness 
of each type. The chapter then briefly discusses grades and grading 
os a way to communicate student progress toward learning goals. 

Using This Manuai 

As you read this publication, the authors hope you will: 

• Consider the variety of possible student outcomes In science, 
and select those that are most Important for students. 

• Reflect on and choose appropriate ways to assess student 
performance for Important outcomes. 

• Develop appropriate criteria for judging student work, 
and consider the alternatives to the teacher as sole judge 
of student work (I.e., using peers, professionals from the 
community, and student self-assessment). 

• Reflect on grading practices and how Information from a 
variety of assessment methods might be Incorporated Into a 
composite picture of achievement. 

• Consider ways to get yourself and your school started In 
analyzing current practices. 

This publication Is not Intended as o text but os a self-study resource. 
We hope you will Interact with It, respond to the questions posed, and 
use the manual os an opportunity to reflect on your assessment prac- 
tices. We suggest that you work through the manual with at least one 
other teacher. If possible, because of the valuable sharing of Ideas 
that will result. 



Final Notes 



A key point to remember as you go through this manuai is that the 
way we assess our students speaks voiumes about what we vaiue in 
education, if throughout 12 years of schooi, students are assessed 
oniy on passive, non-creative work (worksheets, muitipie-choice 
tests), how iikeiy is it that they wiii become probiem-soivers, creative 
producers, effective communicators, and seif-directed iearners as 
aduits? By going beyond muitipie-choice testing, we hope to foster 
these quaiities in our students. 



APPMCATION 



The first step in changing science education assessment is to 
have a clear understanding of your current practices. Please 
answer the following questions and discuss them with another 
teacher. 



Self-Assessment Questionnaire 

1 . List below, in your own terms, the four most important student 
outcomes that resulted from your science instruction last year. 
That is, what could students do well at the end of the year that 
they could not do well at the beginning of your instruction? 



2. Which of the following kinds of work did you require of 
students? 

□ Listen to lectures 

□ Take tests on text/lectures 

□ Take end-ot-chopter tests 

□ Design experiments 

□ Read textbooks 

□ Talk with scientists 

□ Solve problems in a team setting 

□ Maintain journals ot data collected 

□ Do hands-on investigations 

□ Make presentations to the class 

□ Other 

3. In your science classes, on a typical day, how often were 
most students engaged and challenged by their work? 

□ All the time 

□ Very often (more than halt the time) 

□ Often (about halt the time) 

□ Somewhat often (less than halt the time) 

□ Almost never 

4. Think about the assessment methods represented by the 
grades in your grade book. What might your grade book say to 
students about what you value in science education? 






Desired Student 

OUTCOAAES IN 
SCIENCE; 



What Do We Want Students To Be Able To Do? 

Educational goals provide the framework for assessing student prog- 
ress. The goals a teacher has for his or her class have clear implica- 
tions for assessment. Without a clear vision or articulation of what is 
to be accomplished in the time you have with your students, how 
do you know what to assess? Outlining your goals before beginning 
instruction is very important. 

The Notional Science Education Standards publication (1999, p. ix) 
written by the Notional Research Council begins with this goal state- 
ment: "This nation has established os a goal that oil students should 
achieve scientific literacy." This booklet goes on to describe such lit- 
eracy os "Scientific literacy enables people to use scientific principles 
and processes in making personal decisions and to participate in 
discussions of scientific issues that affect society" (1999, p. ix). With this 
description, the National Research Council begins to break its overall 
goal (scientific literacy) into smaller component parts, os depicted in 
FIGURE 2 . 1 . 



FIGURE 2.1 
Scientific Literacy 
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Clearly, the Notional Research Council is emphasizing that students 
need to learn to think like scientists (use scientific processes) as well 
as learn science concepts (knowledge of scientific principles). To 
achieve the goal of scientific literacy, the Council (1999, p. 104) has 
written content standards (statements that elucidate what students 
should know and be able to do) within eight different categories: 

Unifying concepts and processes in science 
Science os inquiry 



• Physical science 

• Life science 

• Earth and space science 

• Science and technology 

• Science in personal and social perspectives 

• History and nature of science 

In this organizational scheme of content standards written within 
the 8 different categories, all of which support the twin goals of 
scientific literacy (understanding of science concepts and science 
processes), the National Research Council has refocused science 
instruction on new facets. This change is summarized in chart form 
(National Research Council, 1999, p. 113) and reproduced here in 
FIGURE 2.2. Note particularly the thinking processes and processes 
related to scientific inquiry that hove added emphasis in the newer 
science curriculum. 



FIGURE 2.2 

Changes in Emphasis in Science Instruction 
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Knowing scientific facts and information 


Understanding scientific concepts and developing 
abilities of inquiry 


Studying subject matter discipiines (physicai, 
iife, earth sciences) for their own sake 


Learning subject matter disciplines in the context of 
inquiry, technology, science in personal and social 
perspective, and history and nature of science 


Separating science knowiedge and science 
process 


Integrating all aspects of science content 


Covering many science topics 


Studying a few fundamental science concepts 


implementing inquiry as a set of processes 


Implementing inquiry as instructional strategies, 
abilities, and ideas to be learned 



Source: National Research Council. (1999). National science education standards. 
Washington, DC: National Academy Press, p. 1 13. 



The Notional Science Education Standards' stated goal (scientific liter- 
acy), the eight categories in which standards ore clustered, and even 
the statements found in the "More Emphasis On" column of FIGURE 
2.2, all provide only very general guidelines for teachers. To further 
elucidate what students should know and be able to do, the Notional 
Research Council also provides content standards within the eight 
stated categories. In the next section of this chapter, we will examine 
o sample content standard for grades 9-12 and attempt to interpret or 
unpoc/c this standard to find the specific learning goals for students. 

Unpacking the Content Standards 

There is o wealth of science knowledge and scientific abilities/pro- 
cesses that could be taught to students. In fact, the overabundance 
of teaching possibilities can be overwhelming for teachers, who won- 
der where to begin and how deep to go. Teachers often feel that 
they must "cover" everything in the textbook. The National Science 
Education Standards provide one means of managing the task of 
identifying the essential science information for students. These stan- 





dards distill the amount of information into o smaller subset of essen- 
tial information. However, even these standards are not totally trans- 
parent; it will still take some expertise to understand exactly what they 
are trying to convey. In this publication, we will unpack the National 
Standards using Bloom's Taxonomy os our framework. 

Bloom's Taxonomy (Bloom, 1956) grouped educational objectives 
into six distinct, hierarchical categories: Recall, Comprehension, 
Application, Analysis, Synthesis, and Evaluation. In practice, the Recall 
and Comprehension categories gradually come to be grouped 
together as "Knowledge." Throughout the years, many teachers 
hove used this taxonomy to ensure that varied levels of thinking were 
encouraged in their classrooms. Here, we use the levels of Bloom's 
Taxonomy to unpack the meaning of the Notional Standards. 

Let's begin with Content Standard A under Science os Inquiry in 
Grades 9-12. This standard states: 

CONTENT STANDARD A: 

As o result of activities 
in grades 9-12, oil stu- 
dents should develop: 

• Abilities necessary 
to do scientific 
inquiry 

® Understandings 
about scientific 
inquiry (Notional 
Research Council, 

1999, p. 173). 

This standard clearly 
relates to the overall 
goal of the National 
Science Education 
Standards— to promote 
scientific literacy. Note 
that it emphasizes 
both science knowl- 
edge and scientific 
processes. It appears 
closely tied to the 
"Ability to use scientific 
processes" box shown 
in FIGURE 2.1. However, 
this is still o very gen- 
eral statement. It does 
not provide sufficient 
specificity for teach- 
ers to understand pre- 
cisely what students 
will need to know or be 
able to do. 

In order to further 
elucidate the meaning 
of Content Standard A, 

the Notional Research Council provided a section titled, "Guide to 
the Content Standard." In this section, they include the following six 
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underlying abilities and concepts related to the "abilities necessary 
to do scientific inquiry" statement: 

1 . Identify questions and concepts that guide scientific 
investigations 

2. Design and conduct scientific investigations 

3. Use technology and mathematics to improve investigations 
and communications 

4. Formulate and revise scientific explanations and models 
using logic and evidence 

5. Recognize and analyze alternative explanations and models 

6. Communicate and defend o scientific argument (National 
Research Council, 1999, pp. 175-176). 

We will call each of the above a benchmark that helps explain 
the meaning of the standard. In this next section, we shall attempt 
to unpack two of these benchmarks even further, using Bloom's 
Taxonomy. 

1. Identify questions and concepts that guide scientific 
investigations. 

Which levels of Bloom's Taxonomy are implied in this state- 
ment? The verbs that begin each benchmark often provide 
clues to the levels of Bloom's. For example, "identify" in the first 
benchmark sometimes correlates with lower-level thinking: the 
Knowledge level. Students ore asked to identify concepts that 
guide investigations, not apply them or analyze them. Other 
verbs that would signal the Knowledge category may include: 
define, list tell, label, match, select choose, name, spell, etc. 
Returning to the benchmark, we find that certainly, students 
will need basic knowledge of scientific investigations. They will 
need to know, for example, that investigations contain certain 
parts as problem-finding, hypothesizing, designing on experi- 
ment, controlling variables, reporting conclusions, etc. They 
will need to understand each of these separate processes, 
view examples of each, and distinguish between high-qual- 
ity and low-quality processes. Students will also need back- 
ground knowledge before they con begin to create their 
own investigations. They will need to find out what is already 
known, what scientific concepts may govern their investi- 
gation, and what safety concerns should be considered. 
Therefore, this benchmark implies that students must hove 
opportunities to gain such Knowledge. 

This is not simply a Knowledge-based benchmark. Students 
ore not asked to learn "about" scientific investigations, buffo 
actually perform one port of them. This means that students 
will also need to learn scientific processes, including ways to 
think like o scientist. This is evident in that students ore asked 
to "identify questions." Flere, we understand that students will 
need to formulate their own scientific questions (problem-find- 
ing) and write testable hypotheses for these questions. These 
activities move beyond the Knowledge level of Bloom's and 
into the Application and Synthesis areas. Flere, students must 
gather what information they have been taught about scien- 
tific investigations and then use this knowledge to construct 
high-quality hypotheses. 



Based upon our unpacking activity so far, what wouid we 
expect to see in the science ciassroom? Certainiy, we wouid 
expect some introduction of vocobuiory terms with accom- 
panying exercises and perhaps some textbook readings on 
vocobuiory terms to show how these terms fit into o scientific 
investigation. The teacher might oiso introduce some outside 
reading, describing o scientific investigation that ied to the 
invention of o usefui everyday object (Veicro, Post-it Notes, 
giue) or the story of o historic scientific investigation (Waiter 
Reed and the cure for yeiiow fever; Fieming and the discovery 
of peniciiiin). 

The teacher couid highiight common investigotionoi steps 
used in the discoveries or inventions and heip students oppiy 
the steps identified to these investigations. A video ciip from 
the movie The Medicine Man might stimuiote discussion of 
the importance of controiiing voriobies. (The doctor in the 
movie seems to hove found o cure for cancer using o tropicoi 
piont. He cannot reproduce the resuits, primoriiy because his 
originoi botch oiso contained ground-up insects that infested 
the tropicoi flower. The insects were the active ingredient in 
the cure, not the flower.) The teacher might stimuiote student 
thinking by having them work in smoii groups to propose prob- 
iems needing scientific investigations. From the cioss set of 
probiems, the students couid then work in their smoii groups to 
write testobie hypotheses for these probiems. 
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In order to check the proficiency of students relative to this 
learning target, teachers might implement vocabulary quiz- 
zes, comprehension questions on readings, o short essay 
describing common steps found in several science investiga- 
tions from the outside reading, and o rubric describing the 
qualities of o testable hypothesis that students could then use 
to assess their own hypotheses and those of peers. 

2. Design and conduct scientific investigations 

In this second benchmark, we find two new verbs: design 
and conduct. What levels of Bloom's Taxonomy do these 
verbs imply? In order to design and conduct on investigation, 
students will definitely hove to App/y the Knowiedge they 




have learned about scientific investigations. In the design 
process, students must Analyze the problem in order to iden- 
tify needed components and/or equipment. Synthesize infor- 
mation from multiple sources to help them choose the proce- 
dure, and Evaluate alternatives to choose the best method 
of investigation. In this benchmark, then, students will work at 
all levels of Bloom's Taxonomy. Here, again, the verbs at the 
beginning of the benchmark help signal the level of think- 
ing required from the students. FIGURE 2.3 may be useful in 
unpacking standards and benchmarks os it contains samples 
of these "signaling" verbs. 



FIGURE 2.3 

Verbs Signaling Cognitive Levels 



COGNITIVE LEVEL IN 
BLOOM’S TAXONOMY 


SIGNALING VERBS 


Knowledge 


identify, define, iist, teii, iabei, match, seiect, choose, name, speii 


Application 


identify, moke use of, pion, organize, deveiop, utiiize, oppiy, try 


Analysis 


compare, dissect, inspect, categorize, contrast, simpiify, distinguish, 
ciossify, examine, conciude 


Synthesis 


buiid, compiie, invent, formuiote, compose, construct, originate, 
change, adopt, soive, predict, moke up, improve 


Evaluation 


criticize, judge, recommend, support, argue, justify, dispute, 
appraise, prioritize, assess, voiue, defend 



What might this second benchmark look like in the science class- 
room? Previously, students may hove identified problems and writ- 
ten hypotheses. From the class presentations of these problems and 
hypotheses, groups of students may choose one such problem, with 
its accompanying hypothesis and develop an investigation to prove 
or disprove this hypothesis. Alternatively, the teacher may propose 
o problem to the class and ask students to design on experiment to 
answer the question. A sample question might be: How can we deter- 
mine the background level of radiation present in this classroom? The 
teacher could provide o graphic organizer that would require certain 
information (safety precautions, independent and dependent vari- 
ables, equipment list, procedural steps, etc.). 

In this manner, several groups may write proposed investigational 
procedures for the some problem. Each group could then present 
its proposal to the class. The class would Analyze all the proposals 
and decide which one would best answer the proposed scientific 
question. They would need to be prepared to justify their argument 
for one proposal over another (Evaluation). Finally, once a particu- 
lar procedure was chosen, each group could actually conduct the 
experiment and report its results. 

To check student proficiency relative to this benchmark, the teacher 
might use o "Scientific Investigation" rubric, an "Oral Presentation" 
rubric, and o short essay requiring students to justify their choice of 
the best procedure. 





In the discussion of the two benchmarks, we have seen that Content 
Standard A encourages thinking at all levels of Bloom's Taxonomy. It is 
important that teachers take the time to dissect or unpack standards 
(whether national or state standards) to ascertain the levels of think- 
ing required in each. Many times, a standard at first glance appears 
to be a Knowledge level standard but can have higher-order thinking 
skills embedded within it. It is also important for teachers to ask, "How 
will this look in my classroom?" as they read through standards. Such 
visualizations can aid teachers in planning high-quality lessons that 
will actually help students meet the standards. 



APPMCATION 



Take one of the remaining benchmarks iisted beiow and unpack it 
for ieveis of thinking using Bioom's Taxonomy. Then, write o brief 
description of how this benchmark might be addressed in your 
ciossroom. 

• Use technology and mathematics to improve investigations 
and communications. 

• Formulate and revise scientific explanations and models using 
logic and evidence. 

• Recognize and analyze alternative explanations and models. 

• Communicate and defend o scientific argument. 
) 

Another Source for Student Learning 
Goals in Science 

So far in this chapter, we have only examined one student goal of 
learning— that of scientific literacy promoted by the National Research 
Council. We have seen that this Council created standards to promote 
scientific literacy within eight different categories. Before we leave 
this discussion of "What Do We Want Student To Be Able To Do?" we 
should first examine some goals from another source. This new source is 
the National Assessment of Educational Progress (NAEP). 

In the Framework for the 2005 NAEP Science Assessment, we find the 
matrix displayed in FIGURE 2.4. 





FIGURE 2.4 

NAEP Science Assessment Framework Matrix 
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Source: Science 
Framework for the 2005 
National Assessment of 
Educational Progress. 
Retrieved 8. 14.05 from 
www.nagb.org/pubs/ 
s framework 05/ch2.html. 



NATURE OF SCIENCE 



Themes 

Systems, Models, Patterns of Change 



Under the separate headings of these domains, the foiiowing student 
expectations are ciustered: 

Conceptual Understanding 



1. Organize important science ideas and express them in their 
own words. 

2. Demonstrate the acquisition of o meaningfui knowiedge 
base. 

3. Successfuiiy exchange ideas and information with other 
students. 

4. Read, comprehend, discuss, and evaiuate information in 
science articies. 

5. Generate, research, and report on questions of interest. 

Scientific Investigations 



1 . Demonstrate the use of science process skiiis (ciassifying, 
deveioping a research question, making predictions, 
coiiecting, onaiyzing, and interpreting data). 

2. Demonstrate the use of ioborotory skiiis. 

3. Generate o hypothesis and design on experiment to test that 
hypothesis. 

4. Determine if measurements ore reiiobie and vaiid. 

5. Moke judgments about the adequacy of evidence 
supporting a hypothesis. 

6. Deveiop oiternotive interpretations and iook at data in more 
than one way. 





Practical Reasoning 

1. Work successfully through a complex problem with a group of 
other students. 

2. Think abstractly and consider hypothetical experiences. 

3. Consider several factors simultaneously. 

4. Take a depersonalized view. 

Nature of Science and Technology 

1 . Identify and summarize examples of how explanations of 
scientific phenomena have changed overtime as new 
evidence emerged. 

2. Demonstrate an understanding of the difference between 
correlation and causality. 

3. Discuss the Interaction of scientific knowledge and values as 
they relate to problems we face. 

4. Summarize the review role of scientific organizations In 
avoiding bias and maintaining quality In published research. 

5. Understand that scientific conclusions are based on logic 
and evidence, but no fixed set of steps makes up a scientific 
method. 

6. Explore the advantages and disadvantages Involved In the 
design and development of technologies. 

7. Summarize examples of how scientific knowledge has been 
applied to the design of technologies. 

8. Understand that models of objects and events In nature can 
be used to understand complex or abstract phenomena. 

9. Understand that systems are often artificial constructs used 
by people to gain a better understanding of a complex Idea 
and that a system construct entails Identifying and defining 
Its boundaries. Identifying Its component parts and the 
Interrelations and Interconnections among those parts, and 
Identifying the Inputs and outputs of the system. 

10. Recognize patterns of similarity and difference, to perceive 
how these patterns change over time, to remember common 
types of patterns, and to transfer their understanding of a 
familiar pattern of change to a new and unfamiliar situation. 

In the NAEP student expectations, we see the same trend (going 
from general statements to more specific ones) that was evident In 
the National Science Education Standards (NSES). We also find that 
several of the categories seem to overlap, as shown In FIGURE 2.5. Both 
sources emphasize learning science processes as well as science 
concepts. Thus, the two different sources appear to have similar 
Ideas about what science students should know and be able to do 
as a result of activities within science classes. 



APPMCATION 



How do your own student expectations compare to those 
of NSES and NAEP? At this point, pieose take a few minutes 
to reflect on what you feei ore important student 
expectations for science instruction. You may wish to 
discuss your responses with other science teachers. The 
foiiowing questions may aid in this discussion: 

• How wouid you rank order the NSES student 
expectations? The NAEP student expectations? Why 
would you choose this order? 

• Do you feel that the NSES and NAEP expectations cover 
oil the essential science content for your course? If not, 
what expectations would you odd to the list? Are there 
any you would delete or revise? 

In the space below, describe for your students and their 
parents the top 4 course outcomes you expect for the year: 

1 . 

2 . 

3. 

4. 

V / 

FIGURE 2.5 

Comparing Student Expectation Categories from Nationai Science 
Education Standards and NAEP 
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One Final Source of Student 
Learning Goals in Science 

The most important source of student learning outcomes or goals 
that teachers should access is the state curriculum. Such state cur- 
ricula are usually organized by grade levels in grades K-8 or by sci- 
ence disciplines (Biology, Chemistry, Earth Science, etc.) in grades 
9-12. Like the National Science Education Standards, your own state 
curriculum may begin with broad goals that are then dissected into 
smaller and smaller component parts. For example, in Florida, the 
state science curriculum begins with the subject area (science), 
then breaks this into strands (The Nature of Matter, Energy, Force and 
Motion, Processes That Shape the Earth, Earth and Space, Processes 
of Life, and How Living Things Interact With Their Environment). Under 
each of these strands, a number of standards further explain what 
students should know and be able to do. The standards are then 
supported by benchmarks, which occur as the most specific level of 
the curricular hierarchy. 

Such specificity can be very helpful to teachers who are trying to 
unpack written statements about what students should know and be 
able to do upon completion of their courses. To ensure that all essen- 
tial content (science concepts, processes, and skills) encompassed in 
state curriculum is actually taught, teachers may find that designing 
a planning matrix similar to the one in FIGURE 2.6 may be helpful. Here, 
the names of the major units constructed by the teacher are written 
across the top and standards from the state curriculum are written 
vertically on the left. The teacher can place checkmarks within the 
units where particular standards will be addressed. In this manner, the 
teacher can clearly map out a course of study that encompasses 
all student learning targets. Standards related to thinking skills or to 
understanding scientific processes (as "Use scientific processes to 
solve problems" and "Science, technology, and society are interde- 
pendent") can be explored in all units, whereas teaching of basic 
science concepts can be focused within particular units. Of course, 
the next steps for the teacher (as explained previously) are then to: 

1 . Visualize the actual activities that must occur in the 
classroom for students to achieve these targets. 

2. Plan student assessments that will measure achievement of 
these targets. 

Planning appropriate assessments for learning targets is the subject of 
our next chapter. 



FIGURE 2.6 

Planning Matrix for Learning Targets 





MAJOR UNITS 


STANDARDS 


ENERGY 

SOURCES 


EARTHQUAKES 
& VOLCANOES 


SPACE/SOLAR 

SYSTEM 


ENVIRONMENTAL 

ISSUES 


WEATHER 


LIVING 

THINGS 


Use scientific 
processes to 
soive probiems 


X 


X 


X 


X 


X 


X 


Aii matter has 
observabie, 
measurabie 
properties 






X 






X 


Basic principies 
of atomic 
theory 






X 






X 


Energy may 
be changed in 
form 


X 












interactions 
of matter & 
energy 


X 












Types of 
motion may 
be described, 
predicted, 
measured 




X 






X 




Types of forces 
and their 
effects on an 
object 




X 


X 




X 




Processes in 
the iithosphere, 
hydrosphere, 
and 

atmosphere 
interact to 
shape Earth 








X 


X 




Need for 
protection of 
naturai Earth 
systems 








X 






interaction and 
organization 
of Soiar System 
with Earth 






X 








Vastness of 
universe 






X 











MAJOR UNITS 


STANDARDS 


ENERGY 

SOURCES 


EARTHQUAKES 
& VOLCANOES 


SPACE/SOLAR 

SYSTEM 


ENVIRONMENTAL 

ISSUES 


WEATHER 


LIVING 

THINGS 


Structure and 
function of 
living things 












X 


Process and 
importance 
of genetic 
diversity 












X 


Interdependent 
nature of living 
things 








X 




X 


Consequences 
of using 
limited natural 
resources 








X 






Natural events 
occur in 
patterns 




X 


X 




X 




Science, 
technology, 
and society are 
interdependent 


X 


X 


X 


X 


X 


X 



Source of standards: Florida Department of Education Sunshine State Standards, 
Grades 6-8. Retrieved 8. 15.05 from www.firn.edu/doe/curric/prek12/pdf/science6.pdf. 



APPLICATION 



Construct a matrix similar to the one found in FIGURE 2.6 using 
standards from your state curriculum. Answer the following 
questions: 

1 . Are there standards that occur in EVERY unit you teach? 
Why or why not? 

2. Must you teach a separate “nature of science” unit, 
or con standards related to this be incorporated into 
existing units? 

3. How does constructing the matrix either increase or 
decrease your ability to see connections among 
science concepts? How might this affect your teaching? 
Your students’ learning? 

4. How comparable is your state curriculum to the NSES? 

To NAEP student expectations? 

5. Do you feel that your state curriculum addresses oil the 
essential scientific knowledge for your class/grade level? 
If not, what should you do? 

V 








PERfORAAANCE- 

Based Assess/iaent; 

Observing Students ’ 

1 

( 

Assessment is an integral part of the teaching and learning process 
since the main goal of education is to produce or facilitate change 
in learners. How do we know if such a change is occurring? How do 
we know if students ore becoming competent and knowledgeable? ' 
To obtain this information, we must select assessments appropriate to 
the desired student outcomes. l’,| 

In CHAPTER 1, we listed several types of assessment methods available |( 

to teachers (see FIGURE 1.2), and in CHAPTER 2, we viewed classroom 
snapshots of how particular methods (essays, quizzes, oral presento- 'I 

tions, demonstrations) might be used to judge student performance ■ 

relative to particular benchmarks. In this chapter, we will elaborate 
on these methods that teachers con utilize to determine what stu- 
dents know or are able to do and particularly emphasize those that ; 

go beyond multiple-choice testing. In this chapter, we define perfor- 1 

mance-based assessment. J 



Performance-Based Assessment 



If o teacher is interested in what students understand about types of 
rocks, she may choose to create multiple-choice questions to obtain 
this knowledge. On o multiple-choice test following the rock unit, 
these questions might appear: 

1 . The three classifications of rock ore: 

o) conglomerate, metomorphic, and obsidian 

b) igneous, metomorphic, and sedimentary 

c) quartzite, igneous, and conglomerate 

d) gneiss, schist, and sandstone 

2. On the Mohs scale, apatite is ranked os a 5. Which of the 
following rocks would scratch apatite, but not be scratched 
by it? 

o) talc with o ranking of 1 and colcite with a ranking of 3 

b) quartz with o ranking of 7 

c) diamond 

d) a and b 

e) a and c 

f) bondc 

These ore legitimate Knowledge-based questions that assess if stu- 
dents know that igneous, metamorphic, and sedimentary are the 
three types of rocks and that rocks with high numbers on the Mohs 
scale will scratch (but not be scratched by) rocks with low numbers. 
Therefore, if the learning goal is o Knowledge-based one, then these 
questions will certainly reveal whether students possess this knowledge. 




CHAPTEI^ THREE 



Let us suppose that our state curriculum mirrors the National Science 
Education Standards, Content Standard D in Earth and Space 
Science for Grades 9-12. This standard states: 

As a result of their activities in grades 9-12, all students 

should develop an understanding of 

• Energy in the earth system 

• Geochemical cycles 

• Origin and evolution of the earth system 

• Origin and evolution of the universe 

The key term in this standard is "understanding." What does it mean 
for students to understand fhe "origin and evolution of the earth 
system"? As we did in CHAPTER 2, we must first unpoc/c this standard 
to determine the essential science information students will need 
to know. For example, this standard implies that Earth has changed 
since its origin— it is not the same today as it was originally. What 
then, have been the causal agents of these changes that students 
may need to know? The following concepts may occur to the experi- 
enced Earth Science teacher: 

Uniformitarianism Unconformity 

Principle of Superposition Earthquakes 

Principle of Original Horizontality Erosion 

Folding and Faulted Layers Volcanoes 

Principle of Cross-Cutting Relationships Deposition 

We can define each of these terms for students, provide physical 
and virtual demonstrations (as using soft clay layers to model folding, 
mountain building and faulting, OR provide students with a link to a 
website that uses a multimedia presentation to demonstrate these), 
ask students to create 2-D and 3-D illustrations/models of the princi- 
ples, provide textbook readings as well as readings from other source 
materials, and show students photographs of geologic sites while 
explaining which geologic forces/principles caused unique forma- 
tions. How will we know, however, that students truly understand how 
the Earth changes? Are multiple-choice questions enough? 

To demonstrate understanding, we want students to do more than 
just recognize or recall the right answer to a question. We want to 
stimulate their higher-order thinking skills, as application, analysis, 
synthesis, or evaluation. One way to do this is to use a performance- 
based assessment. Some examples of performance-based assess- 
ments related to the higher-order thinking skills for this standard 
might include: 

Application and Analysis. The teacher has already shown the 
students photographs of geologic sites and explained how these 
might have formed. Provide students with a new photograph 
(one the teacher did NOT explain) and ask them to write a plau- 
sible explanation of how this site formed, using at least three of 
the vocabulary terms. 

Synthesis. Ask students to create a five-step sequence of geo- 
logic events, using at least three of the vocabulary terms. Then, 
have them write a brief description explaining each step and 
provide an illustration of the landform at each step. 



Evaluation. Ask students to support or refute this statement: The 
Mississippi River Deita and Deiicate Arch in Utah were formed by 
simiiar processes. 

All three of the above performance-based assessment examples pro- 
vide students with an opportunity to demonstrate what they know, 
rather than just regurgitating a definition or recalling isolated bits of 
information. In FIGURE 1.2 of CHAPTER 1, we listed three main categories 
of performance-based assessment methods: 

1. Observing students using informal observations and 
structured observations 

2. Soliciting information from students via interviews or self- 
assessment questionnaires 

3. Evaluating student work using open-ended questions, 
performance tasks, journals, exhibitions, and portfolios 

The three geologic change examples would all fall in the "evaluating 
student work" category, as all are open-ended questions (questions 
that require students to construct a response, rather than choose a 
response from a list of possible answers). In the next few chapters, 
we will examine each category of performance-based assessment 
in more detail and provide suggestions for implementing them in 
the science classroom. In this chapter, we focus on the first category 
—observing students. 

Observing Students 

Teachers constantly observe students. They see that Juan is arguing 
with his group, Serena looks confused about a new concept, Nikita 
is daydreaming, Chavez is working hard, etc. Such observations are 
informai in nature and may serve both assessment and classroom 
management functions. Teachers also make more formal observa- 
tions of students, as when they use an observation instrument to col- 
lect data about student performance. These formal observations are 
classified as sfrucfured observations. Some goals or objectives can 
only be assessed by such structured observations. For example, it is 
difficult to imagine how a teacher would assess students' team prob- 
lem-solving skills or success at independent lab work without observ- 
ing them. 



Informal Observations 

With informal observations, teachers are actively observing the 
students, but no particular group or individual is the target of the 
observation. Similarly, this type of observation generally occurs spon- 
taneously and may not have a predetermined focus. However, infor- 
mal observations can be very useful assessment tools. Through such 
observations, teachers may, for example, become aware of students 
in their classes who are able to work independently and teachers 
can also identify those who require a great deal of assistance. More 
formal observations can then be planned to capture detailed infor- 
mation on these struggling students. Information from informal obser- 
vations can greatly impact the classroom instruction, as the teacher 
uses such information to plan differentiated instruction for his/her 
diverse students. Such observational information can also provide the 
basis for reports to parents via phone calls or conferences. 



structured Observations 



structured observations, unlike informal ones, usually hove o speci- 
fied focus and o specific target group (or individual). In order to col- 
lect information relevant to the focus of the observation, a teacher 
may use an observation instrument. Such on instrument is often in 
fable or matrix form with students' names listed down one side and 
particular behaviors listed across the top. For example, suppose on 
elementary science teacher has recently set up five science activity 
centers where his students con individually engage in hands-on sci- 
ence. This teacher may wish to evaluate students' progress by seeing 
if the students stay on task and if they ore able to work independently 
with the center materials. He develops o form similar to the one 
depicted in FIGURE 3.1 to use in collecting this observational informa- 
tion about students. He lists the students being observed in the space 
provided. Then, he observes these students during o 10-15 minute 
individual hands-on science activity occurring at the five centers. For 
each student, he records information about their on-tosk behavior (o 
check in the box denotes on-tosk work while o blank box means the 
student was off-task) and notes if assistance is needed or solicited. 

If he is able to observe oil five students within the 10-15 minutes and 
there is still time left over, he may perform another round (or several 
more rounds) of observation on these students. 

FIGURE 3.1 

Hands-On Science Activity Observation Form 

Dote of Observation 



Time Observation Began Time Observation 

Ended 



Hands-on Science Activity Description: 



STUDENT 

NAMES 


OBSERVATION 
ROUND 1 


OBSERVATION 
ROUND 2 


OBSERVATION 
ROUND 3 


OBSERVATION 
ROUND 4 


ON 

TASK 


ASSISTANCE 

NEEDED 


ON 

TASK 


ASSISTANCE 

NEEDED 


ON 

TASK 


ASSISTANCE 

NEEDED 


ON 

TASK 


ASSISTANCE 

NEEDED 
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Legend 

✓ in On Task box means that student was working to compiete the task when observed 
Code for Assistance Needed: N = None, S=Some, M= Much 
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The information coiiected on such observation forms couid be used 
in a variety of ways. Structured observation data often aiiows teach- 
ers to profit from new information that may chaiienge some of the 
inferences they have made about students. For exampie, before coi- 
iecting the Hands-On Science Activity data, the teacher might have 
assumed that Mai wouid have difficuity staying on task. However, 
after the first observation in September, he reaiized that Mai func- 
tioned very weii when working independentiy. The teacher found, 
however, that Aiice, Juanita, and George needed assistance to use 
the materiais appropriateiy. 

Observationai data coiiected over time can be usefui for showing 
changes in student performance. FIGURE 3.2 dispiays the data coi- 
iected about students' on-task behavior over three different observa- 
tion periods. These data show a generai pattern of improvement over 
time on independent iab work and aiso reveai which (and how many) 
students need improvement in this area. The teacher can share the 
data with students when discussing their performance in science ciass 
with them and setting goais to improve this performance. 

FIGURE 3.2 

Hands-On Science Activity 
Observationai Data Summary 



Observation Dates: September 12, January 23, and May 5 



STUDENT 


NUMBER OF TIMES OBSERVED WORKING ON TASK 


NAMES 


SEPTEMBER 


JANUARY 


MAY 


Alice 


1 


2 


3 


Mai 


4 


5 


4 


Juanita 


2 


3 


4 


Michael 


4 


4 


5 


George 


2 


4 


5 



From this data, it is easy to see that Aiice, Juanita, and George have 
made gains in on-task behavior, whiie Mai and Michaei have main- 
tained their abiiity to work independentiy. 

Besides tabies and matrices, another usefui format for recording 
observation data is the taking of anecdotal notes. Anecdotai notes 
are simpiy narratives that describe observed behaviors. Such nar- 
ratives are particuiariy appropriate to use when observing compiex 
behaviors, such as group interactions, that do not iend themseives 
easiiy to a checkiist format. For exampie, a science teacher may 
observe and describe a cooperative team probiem-soiving activity. 
The purpose of the structured observation wouid be to determine 
how students on the team contributed to the compietion of the activ- 
ity. An exampie of anecdotai notes taken by the teacher during this 
activity is dispiayed in FIGURE 3.3. 








FIGURE 3.3 

Anecdotal Notes on Group Problem-Solving Activity 



Observer: Mrs. Lee 
Time: 1:20-1:30 PM 
Date: Sept. 12 

Group Observed: Crystal, Jack, Ramon, and Anita 

Purpose of the Observation: 

To be able to describe to students how their behaviors contrib- 
uted to or detracted from the group’s efforts to solve the prob- 
lem. One of the goals for the year is the development of group 
problem-solving skills. This assessment approach documents 
student functioning relative to this goal. 

Notes: 

Crystal reminded the group that they needed to choose a 
recorder. Ramon volunteered to be the recorder and write 
things down if they told him what to write. Jack said, "What 
are we supposed to do?" Anita looked at the worksheet and 
began reading aloud the directions for the activity. Jack started 
blowing in the air and talking about wind. Crystal told Jack to 
stop playing. He looked at his sheet for a moment and then 
started blowing again. 

The first section on the worksheet asked the students to iden- 
tify the different properties of wind. Crystal told Ramon to write 
down: "the way it blows." Anita offered, "how fast it goes." The 
next section asked the students to find a way to measure one of 
the properties they had identified. Crystal said that they should 
build a weather vane to show the direction the wind blows; 
Ramon and Anita agreed. Jack didn't say anything. He was 
busy drawing a sailboat Crystal sent Jack off to the side of the 
room to get materials to build the weather vane. Jack returned 
with the materials and immediately started to put them together. 
Crystal went to the side of the room to get the things Jack forgot 
Each of the children began building their own weather vanes. 
Jack wanted everyone in the group to see his when he blew on 
it The other children began blowing on theirs. After a few min- 
utes, Crystal decided that Jack's weather vane was the best 
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These anecdotal notes constitute a written record of how the stu- 
dents worked together to solve the problem. The notes provide evi- 
dence about how each of the students contributed to the problem- 
solving activity and can help the teacher discern patterns of student 
behavior. From this brief narrative, it appears that Crystal is very task- 
oriented and is working as the group leader. Mrs. Lee (the teacher) 
can use this information to set goals for individual students or to struc- 
ture future cooperative groups. Mrs. Lee will probably target groups 
experiencing difficulties in cooperative behavior for further observa- 
tions, and the anecdotal notes from these can provide a basis for 
teacher comments and recommendations to the groups. Over time, 
a series of anecdotal notes may also help document how students 
changed the way they worked in teams. 



APPLICATION 



Reflect upon the student outcomes you wrote in CHAPTER 2 , and then examine 
your state standards. 

Identify those outcomes/standards that could be best assessed by a structured 
teacher observation. Examples may include outcomes/standards that would 
require students to act or behave in certain ways, as work cooperatively, 
organize materials, persist in work even after encountering obstacles or 
problems, etc. 

Observations may also be useful when you suspect a student has a specific 
learning disability or when you are trying to identify the auditory, visual, or 
kinesthetic learners in your classroom. 

1 . Create an observation instrument for this observation. 

2. Perform the observation. 

3. Analyze your results. 

4. Answer these questions: 

a) Did your results confirm previous inferences you held? 

b) Did the observation reveal any surprising results? 

c) What will you change about your classroom, based upon the results of 
this observation? 
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PERFORAAANCE-'BASED 
ASSESSAAENT; SOUCITING 

Information From 

STUDENT'S 



In the last chapter on observing students, several examples were 
provided of how teachers collect Information about student perfor- 
mance. A second method of collecting Information about students 
Involves the analysis of replies that students give In interviews and on 
self-reporting questionnaires. Again, the types of assessment meth- 
ods described here fall under the heading of performance-based 
assessment. In such assessment, students construct responses, rather 
than choosing responses from o given list. Performance assessments 
emphasize higher-order thinking skills (Appiication, Anaiysis, Synthesis, 
and Evaiuation) that go beyond the Knowiedge Level. 

As mentioned above, when we solicit Information from students to old 
us In assessing their performances, we can use Interviews or question- 
naires. Interviews Involve face-to-face verbal exchanges between 
the teacher and the student. In self-reporting questionnaires, students 
respond to written questions and statements. The focus of the Inter- 
views or questionnaires may be on o cognitive event (e.g., what stu- 
dents understand about a particular topic), how they feel (e.g., what 
they like or dislike about working In groups), or on personal behaviors 
(e.g.. If they talk about science topics at home or read science books 
In their leisure time). 

Interviews 

Although Individual Interviews with students ore time-consuming and 
difficult to manage In o classroom setting, there ore several reasons 
why they ore worth the effort: 

/. For those students who seem to have trouble with a particu- 
lar concept or skill (as demonstrated on assessments), inter- 
views may be a way of further assessing their functioning 
relative to instructional objectives. A series of probing ques- 
tions can be developed that would be useful in deciding 
how to help students improve their performance. 

Possible Use: Mrs. Juarez notices that Trung Is on enthusiastic 
science student who frequently asks probing questions and 
volunteers correct answers to her In-class questions. However, 

Trung Is doing poorly on written work— homework and tests. 

Mrs. Juarez Is puzzled by this disconnect between Trung's 
verbal and written work. She schedules on Interview with Trung. 

2. If a new unit is being developed, interviewing a sample of 
students of different abilities about their prior knowledge 
on the topic should allow the teacher to assess students' 
readiness to learn the new topic. Instruction could then be 
designed to target their entry level of knowledge. 




A Time This Wouid Have Heiped: In Chemistry II, Mrs. Butler 
alludes to several concepts from Chemistry I class. Not 
everyone in Chemistry II hod Mrs. Butler for Chemistry I 
lost year. Finally, one brave student raises her hand and 
confesses, "I don't know what you mean. We didn't cover this 
lost year." 

3. interviews can send a message to students that a teacher 
cares about what they think, what they are interested in, and 
what they understand. Rapport is encouraged and student 
motivation may be increased. 

A Time This Wouid Have Heiped: A team of teachers in a 
middle school plans on interdisciplinary unit on Radiation. 

The science teacher decides to focus on scenarios from 
the Cold War emphasizing the dangers of radiation from 
atomic bombs. This teacher grew up in Florida during the 
Cuban Missile Crisis, so this topic is highly relevant to her. 

When she teaches the unit, however, she notices that the 
students ore simply "going through the motions." They ore not 
interested in the subject at oil. What was highly interesting 
and motivational for her does not hove the some effect 
on her students. Interviewing the students to find out what 
would interest them in this area could hove increased student 
motivation to learn. 

4. interviews aiiow students who have difficuity with written tests 
to express what they understand in a context that may be 
iess threatening and anxiety producing. On the flip side, 
students who do well on written tests may have difficulty 
communicating their responses to verbal questions and may 
need practice in doing so. 

Possible Use: Morlee flunks every written test in science, yet 
seems to know the science material. Mr. Chapman schedules 
on interview with Morlee before the next test in order to 
probe her knowledge of test items and to hove her read and 
explain o sample passage from the text in order to explore 
any reading difficulties she is experiencing. 

5. Interviews provide teachers the opportunity to probe and ask 
follow-up questions in ways that challenge students to think 
beyond their current level of understanding and to organize 
their knowledge in more systematic ways. Thus, follow-up 
questions can be individualized such that students are 
pushed as far as their level of understanding permits. 

Possible Use: Mario is the Jeopardy "queen" in Mrs. Sicco's 
science class. Every time this middle school class ploys the 
review Jeopardy gome, Mario wins. She knows ALL the 
facts. Flowever, Mrs. Sicco notes that Mario does poorly 
on questions that ask her to synthesize knowledge and 
that Mario consistently receives poor marks on student- 
constructed concept mops. Mrs. Sicco schedules on 
interview with Mario to help her formulate connections 
among facts by helping her organize and categorize her 
knowledge on o topic. 

6. One common student outcome in science courses is that 
students will learn to communicate effectively. If science 
teachers promote this goal, interviews are clearly an 



assessment method of choice. That is, students shouid not 
oniy be assessed with written tests, but aiso shouid be asked 
to express what they know verbaiiy. 

Possibie Use: David is one of Mr. Chang's students in 6th- 
grade science cioss. He maintains average to high scores on 
aii written assignments, yet he roreiy speaks in ciass. in fact, 
he is so quiet, he couid easiiy become one of those students 
who "siip through the cracks." His scores and his behavior 
in cioss are not iow enough or negative enough to warrant 
intervention or attention. He couid pass through Mr. Chang's 
cioss without ever making o connection with Mr. Chong. 

An interview might reveoi ways to get David more active 
verbaiiy and reveai ways he couid improve his performance 
in science, in addition, an interview couid reinforce for David 
that practicing his verbai expianotions is os important os 
practicing written ones. 

interviews, os the ones described above, can vary in their degree 
of structure, in unstructured interviews, the content and order of the 
questions vary with the student and ore responsive to each student's 
answers. The exampie of Mrs. Juarez and Trung wouid exempiify such 
on unstructured interview. Mrs. Juarez is truiy puzzied by the differ- 
ences in Trung's written and ciass behaviors. She may simpiy begin the 
interview by pointing out the discrepancies and then key any further 
remarks on Trung's responses. Such unstructured interviews aiso occur 
any time the teacher and o student share personoi dioiogue, as when 
the teacher stops by a student desk when circuiating. Such mini-inter- 
views occur spontaneousiy, on o daiiy basis, and ore used by teach- 
ers to assess students' competence reiative to instructionai exompies. 

in semi-structured interviews, there may be some themes identified 
to structure the interviews, but questions within those themes may be 
phrased differentiy for different students. For exampie, in the David/ 
Mr. Chong scenario above, Mr. Chong may impiement a semi-struc- 
tured interview. The "theme" for the interview wiii be David's verbai 
behavior, in this interview, Mr. Chang wiii iook for ways to increase this 
behavior; in interviews with other students, Mr. Chong may be iooking 
to curtaii such behavior! 

in structured interviews, teachers ask students to respond to the some 
set of questions. The Radiation Unit exampie described above wouid 
have benefited from structured interviews of students. The students 
couid have responded to the some set of questions about their inter- 
est in radiation. The middie schoo! science teacher couid then have 
pianned o more motivating unit on radiation for these students. 

An interview may, at times, substitute for o written test when the 
teacher wishes to determine what students know, if a teacher wonts 
to give students an opportunity to be interviewed on their under- 
standing of a topic rather than taking a quiz on this information, the 
set of questions shouid be simiiar for aii students choosing this option. 
Therefore, this is another use of structured interviews. 



Collecting Usable Data from Interviews 



An interview, iike an observation, is experience-based, it occurs 
quickiy, and then it is simpiy over, in order to effectiveiy use interviews, 
some means of capturing data is necessary. Trying to take onecdotai 
notes during on interview is difficuit and may actuaiiy interfere with 
the purpose of the interview, if the teacher is using the interview to 
demonstrate caring, attention given to note-taking instead of atten- 
tion given to the student wiii send the wrong message. Aiternotiveiy, 
interviews con be oudiotoped. The teacher con then iisten to the 
audiotape ioter to form inferences or conciusions or to write heipfui 
anecdotoi notes. 

An interview instrument (very simiior to an observation instrument) may 
aiso be deveioped prior to the interview. Aii structured interviews in 
which oii students respond to the some set of questions wiii invoive the 
use of such a data-capture instrument. Simiiariy, in the case of Mariee 
and Mr. Chapman above, Mr. Chapman couid bring a iist of science 
concepts to the interview to discuss with Mariee. Based upon Moriee's 
responses, Mr. Chapman couid rate her knowiedge of the concepts 
from poor to exceiient. He couid then heip Mariee devise strategies to 
improve her performance on the "pooriy" rated concepts. 

in effect, Mr. Chapman is giving Mariee on orai exam, rather than 
a written one. Such oroi tests may give those students who have 
poor iiterocy skiiis a chance to succeed, in addition, this assessment 
method provides the teacher with assurance that students under- 
stand the test question. Converseiy, written exams make the assump- 
tion that students understand the questions asked. Another advan- 
tage of oroi exams is oiso reported by some teachers. They report 
that students take oroi tests more seriousiy because they feei such 
tests are more personai expressions of competence than a written 
test wouid be. Students may prepare more carefuiiy if they know they 
must stand before o teacher and answer questions individuoiiy. 

FIGURE 4.1 provides on exompie of an orai exam on the three phases 
of water. This exam couid be considered o structured interview, 
in that the same set of questions is used with oii students. A rating 
scaie for answers is provided for each question, ensuring effec- 
tive dato-copture. Students respond to the question, the teacher 
records the students' answers, and then rotes these answers by 
assigning point vaiues. 



FIGURE 4.1 

Oral Exam on the Three Phases of Water 

Student’s Name: 

Date: 



SCORING KEY 


POINTS 

AWARDED 


QUESTIONS 


1 point for each phase 
identified correctly 
(ice, water, steam) 




1. What are the three phases of water? 


0 = Incorrect 

1 = Partially correct 

2 = Satisfactory 




2. Describe each of the three phases: 

a) Ice 

b) Liquid 

c) Steam 


0 = Incorrect 

1 = Partially correct 

2 = Satisfactory 




3. What happens when water goes from one 
phase to the other: 

a) Ice to liquid? 

b) Liquid to Ice? 

c) Liquid to Steam? 

d) Steam to Liquid? 


No rating 




4. Is there anything you do not understand 
about water phases? 



Total Points Awarded = (Maximum is 17) 

V ) 



The results of the oral exam displayed in FIGURE 4.1 could be used in 
a number of ways. Students who had less than 17 points could be 
assigned a peer coach who scored all 17 points on the exam. This 
peer coach could work on the questions with the students until he or 
she was ready to retake the exam. The second administration could 
result in a score entered into the grade book. 

No matter what type of interview protocol is chosen (unstructured, 
semi-structured, or structured), it is important to obtain usable and 
useful data from the interview. This is particularly important in that 
interviews are very time-consuming. The teacher must ensure that the 
interview is actually worthy of the time commitment. The suggestions 
for interviews found in FIGURE 4.2 may be helpful in this area. 





FIGURE 4.2 

Suggestions for Interviews 
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1 . Use sampling techniques to choose participants for your 
interviews. In other words, if o small sample of your students 
con provide the information you require, don’t try to 
interview oil the students. 

2. In one school year, try to ensure that every student in your 
class participates in at least one interview with you. 

3. Keep the tone of the interviews positive and constructive. 

Try not to give verbal or facial expression cues that con be 
interpreted os meaning that on answer is silly or that the 
student has mode on error. 

4. Let students respond without interruptions, and give them 
time to think before they respond. (Remember Wait Time 
One and Wait Time Two. Wait Time One means waiting at 
least 5 seconds after you pose o question before calling 
on o responder. Wait Time Two reminds teachers to wait at 
least five seconds after the student has responded before 
speaking.) 

5. Try to keep interviews short and focused on truly important 
questions. 
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APPUCATION 



1 . Review the six examples of interviews 
described earlier in this chapter. Choose 
one interview type that you would like to 
implement in your class (structured, 
semi-structured, or unstructured). Create 

o doto-copture instrument for this interview 
and implement. 

2. Reflect on the students in your class. 

Would on interview with any of your 
students be useful to you in assessing their 
performances? Moke o list of students who 
might benefit from on interview. 

3. Strotegize ways to schedule student 
interviews (what will other students do 
while some ore interviewed?). 

V 



Self-Assessment Questionnaires 

Every assessment tool has advantages and disadvantages; some 
serve a particular purpose better than others. Student self-assess- 
ment questionnaires may be helpful in determining how students 
perceive their own knowledge, skills, or the quality of their work. Such 
questionnaires may also reveal concerns students have about their 





academic progress, their prior ievei of experience with o topic or skiii, 
their feeiings about the ciass, and their interest in science os o career. 
Questionnaires ore aiso exceiient methods to use if teachers wont 
to see the ciassroom through the eyes of o student, it is often usefui 
to compare student perceptions to teacher perceptions in order to 
understand how ciassroom instruction and assessment procedures 
con be improved. 

When used oppropriateiy, seif-ossessment questionnaires activeiy 
invoive students in reflecting on their own ieorning processes (promo- 
tion of meto-cognition) and emphasize the importance of students' 
awareness about what they know and what they need to know. 
Therefore, seif-assessment questionnaires certoiniy foii within the 
parameters of performance-based assessment, in that they promote 
student cognition beyond the Knowledge ievei. These question- 
naires ore in o specioi category of performance-based assessment, 
however, in that they usuaiiy contain a mix of seiected response 
and constructed response items. Most of the other performance- 
based assessment methods we have previousiy discussed in this 
monuai reiied on constructed response items oniy. Figure 4.3 dispiays 
one exampie of a seif-ossessment questionnaire that uses o mix of 
seiected (see question 1) and constructed response items. 



FIGURE 4.3 

Science Skills Self-Assessment 




Directions: Read the questions and statements below and then answer each as 
best as you can. There are no right and wrong answers. 

1. How would you rote your interest in science right now? 



□ Very High □ High □ Medium □ Low □ Very Low 

2. What did you like the most about science last year? 

3. What did you like least? 

4. Put a check by each instrument you have used. Beside each instrument, 
describe briefly what it does. 

□ Microscope 

□ Weight scale 

□ Thermometer 

□ Weather vane 

□ Ruler 

□ Barometer 

□ Compass 

□ Rain gauge 

5. What do you like or dislike about working with a team of students? 









The self-assessment questionnaire displayed in FIGURE 4.3 is one that 
a teacher might give to students at the beginning of the year to bet- 
ter understand their science background and interests. In administer- 
ing the questionnaire, the teacher might show the students each of 
the instruments listed in Question 4 so that students who knew how 
to use the instrument but had forgotten the name of the instrument 
could respond. 

The teacher could use the assessment results in several ways. First, 
the teacher may wont to summarize the frequency of responses to 
the interest question (QUESTION 1) os a baseline for comparison to 
responses on this some question at the end of the year. Summarizing 
the responses to the instrument question (QUESTION 4) in o frequency 
chart (instruments by number of students who hod used each and 
could describe the function of each) could assist the teacher in judg- 
ing how much remediation was needed. If cooperative learning skills 
were to be a focus for the year, the names of students who indicated 
dislikes about working in o team (QUESTION 5) could be listed, and 
anecdotal notes kept about any difficulties they hod when teamwork 
was initiated. 

Students con also be queried via self-assessment questionnaires on 
their understanding of science concepts. Yager and Kellermon (1992) 
note that o teacher might list the topics to be covered over a period 
of time (e.g., carbohydrates, concentration, starch, glucose, diges- 
tion). They suggest that students could be asked to rate each con- 
cept using the key found in FIGURE 4.4. 

FIGURE 4.4 

Key for Rating Understanding of Science Concepts 



NUMBER 


STATEMENT 


1 


1 have never heard of it. 


2 


1 have heard of it but do not understand it. 


3 


i think i understand it partiaiiy. 


4 


i know and understand it. 


5 


i can expiain it to a friend. 



Questionnaires on understanding of science topics do not necessarily 
hove to be this formal or even written. When o new topic is presented 
in class, teachers con issue students three pieces of colored paper 
(green, yellow, and red). As on explanation of the new topic pro- 
gresses, the teacher con stop periodically and ask students to hold 
up the appropriate piece of colored paper. Green means "Keep 
going. I'm with you." Yellow means "I'm o little confused. Please 
explain this again or in o different way." and Red means "StopI 
You've lost me." 

Such checks on student perceptions con inform the teacher of com- 
prehension problems immediately— os they ore happening. This pre- 
vents the teacher from simply assuming that oil students understood 
the topic of the day. Self-assessment questionnaires of the above 
two types (written assessments in which students use o rating scale 
and in-class assessments using colored papers) ore often perceived 





by students to be less threatening than a pre-test or comprehension 
quiz. They can give students a sense of the different levels of knowing 
(meta-cognition) if used frequently in a class situation. 



APPUCATION 



Design one self-assessment questionnaire that would be useful to your 
class. This con be a questionnaire soliciting prior knowledge, present level 
of knowledge, or student feelings/beliefs. Complete the questionnaire 
yourself and then compare your answers to the summarized responses of 
your students. Answer the following questions: 

• What surprised you about the student results? 

• How con the information you garnered through this questionnaire help 
improve instruction or assessment procedures in your classroom? 

• Did you discover any student misconceptions? 

V y 





PERfORilAANCE-BASED 

ASSESSflAENT; 

Bvkwkvng 
Student i^or< 

In keeping with the title of this publication, we have focused primarily 
on assessment methods that go beyond the multiple-choice test. We 
have emphasized performance-based assessment in the past two 
chapters, and we continue this emphasis in CHAPTER 5. To this end, 
this chapter covers the following types of performance-based assess- 
ments: open-ended questions, performance tasks, logs, journals, 
portfolios, and exhibitions/projects. These assessments all involve the 
evaluation of student work. Such work is often tangible, as products 
are created (written answers to questions, log entries, journal entries, 
portfolio artifacts, science backboards, formal lab reports, etc.) 
Sometimes teachers must evaluate intangible student work, as oral 
presentations, student demonstrations, re-enactments, debates, etc. 
The purpose of this chapter is to offer suggestions for implementing 
both tangible and intangible assessments that involve teachers in 
evaluating student work. 

Open-Ended Questions 

Rather than having students select a response, open-ended ques- 
tions ask students to produce a response. The length of the responses 
can vary considerably based upon the age of the student, the 
question asked, and the time provided to complete the question. 
Open-ended questions, like other performance-based assessment 
methods, require students to use higher-order thinking skills and there- 
fore exercise more complex cognitive processes than simple multiple- 
choice questions. Gronlund and Linn (1990) found that open-ended 
questions particularly tapped into such high cognitive processes 
when students were asked to respond to the types of question starters 
found in FIGURE 5.1. A level of Bloom's Taxonomy is matched to each 
of these "starters" in FIGURE 5.1, showing how the question targets 
higher-order thinking skills and a sample science question is also dis- 
played in this figure. 





FIGURE 5.1 

Open-Ended Questions Requiring 
High Cognitive Processes 



STARTER FOR THE 
QUESTION 


SAMPLE SCIENCE QUESTION 


BLOOM’S LEVEL 


EXPLAIN A CAUSE-EFFECT 
RELATIONSHIP 


Why may too-frequent reliance on 
penicillin for the treatment of minor 
ailments eventually result in its diminished 
effectiveness against major invasion of 
body tissues by infectious bacteria? 


Analysis 


DESCRIBE AN 
APPLICATION OF 
A PRINCIPLE 


Would you weigh more or less on the 
moon? On the planet Jupiter? Explain. 


Application 


FORMULATE A QUESTION, 
HYPOTHESIS, OR 
A CONCLUSION 


What questions should o scientist ask in 
order to determine why more smokers 
than nonsmokers develop lung cancer? 


Synthesis 


DESCRIBE THE 
LIMITATIONS 
OF THE DATA 


In this class, we conducted o survey 
concerning school uniforms. Are we 
ready to moke o report to the school 
board on how our middle school feels 
about this issue? Why or why not? 


Evaluation 


EXPLAIN A METHOD OR 
PROCEDURE 


One of the big ideas in physics is Newton’s 
Third Low. State this low, explain its 
meaning, and give one real-life example 
(other than those used in the text or 
discussed in class) of this low in action. 


Application 


INTEGRATE LEARNING IN 
DIFFERENT AREAS 


Using the human and wildlife population 
density mops below, moke o recommen- 
dation about where to locate the new 
airport. Remember to preserve as many 
wildlife habitats os possible. 


Synthesis 


CREATE OR DESIGN 
SOMETHING (I.E., AN 
EXPERIMENT) 


Devise on invention that mokes on 
everyday task easier to accomplish. 
Use (and label) at least three simple 
machines on your design. 


Synthesis 


EVALUATE THE WORTH OF 
AN IDEA 


The Florida Fish and Wildlife Commission 
is debating lifting the bon on alligator 
hunting in the state. Present the pros and 
cons of such o change in state policy. 


Evaluation 



As we have seen in FIGURE 5.1, open-ended questions con stimuiote 
the use of higher-order thinking skiiis. Such compiex open-ended 
questions con oiso heip assess o variety of instructionoi goois, inciud- 
ing conceptuoi understanding, appiication of knowiedge, the use of 
science process skiiis, and divergent thinking skiiis. Exompies of how 
open-ended questions con be used with each of these instructionoi 
goois ore dispioyed in FIGURE 5.2. 






FIGURE 5.2 



INSTRUCTIONAL 

GOAL 


QUESTION 


EXPLANATION 


CONCEPTUAL 

UNDERSTANDING 


How would life and the conditions on 
earth be different if all bacteria and 
fungi became extinct? Explain the 
changes that might occur and give as 
much detail as possible (Grade 8). 

Source: Open response released item 
(1991-1992), Kentucky Instructional 
Results Information System. Kentucky 
Department of Education. Division 
of Curriculum, Assessment, and 
Accountability, Capital Plaza Tower, 
Frankfort, KY 40601. 


The question asks students to access 
background knowledge, organize 
and express ideas in their own words, 
and generate a report on a question. 
All of these activities tap into the con- 
ceptual understanding the student 
has for the interdependence of life 
on Earth. 


APPLICATION OF 
KNOWLEDGE 


Using the weather map displayed 
on this page, make a forecast for the 
weather in North Carolina for the next 
day. Explain why you made the fore- 
cast (Grade 6). 

Source: Open response released item 
(1991-1992), Kentucky Instructional 
Results Information System. Kentucky 
Department of Education. Division 
of Curriculum, Assessment, and 
Accountability, Capital Plaza Tower, 
Frankfort, KY 40601. 


Before answering this question, stu- 
dents have used weather maps and 
worked on weather predictions. They 
have learned the symbols associ- 
ated with fronts, various types of 
precipitation, isobars, etc. and stud- 
ied how each may affect a region’s 
weather. They are now being asked 
to apply this knowledge to a new 
(previously unseen) weather map. 


USE OF SCIENCE 
PROCESS SKILLS 


You are a state scientist asked to 
develop an experiment to determine 
whether discharge from a factory 
is endangering Kentucky Lake 
(Grade 12). Identify several possible 
consequences of the discharge. 
Choose one of the consequences and 
design an experiment to investigate 
whether the consequence is actually 
occurring and if it is caused by the 
discharge. Describe how you would 
investigate, the kinds of data you would 
collect, and what you would do with 
your data. 


Here, students will need to know 
the integral parts of a scientific 
investigation and how to apply this 
knowledge to an actual problem/ 
question. 




Source: Open response released item 
(1991-1992), Kentucky Instructional 
Results Information System. Kentucky 
Department of Education. Division 
of Curriculum, Assessment, and 
Accountability, Capital Plaza Tower, 
Frankfort, KY 40601. 




DIVERGENT THINKING 
SKILLS 


Suppose there were no more disease 
in the world. List as many possibilities/ 
consequences as you can for what 
might happen in the future as a result 
of this. 

Source: Assessment Ideas for Science in 
Six Domains (1992). Robert E. Yager and 
Lawrence R. Kellerman (Eds.). Science 
Education Center. Van Allen hall. 
University of Iowa, Iowa City, Iowa 52242. 


Divergent thinking requires 
students to create multiple, original 
approaches to problems. Scientists 
use divergent thinking to generate 
research questions and hypotheses 
and to develop plans of action. Here, 
students are asked to consider the 
consequences of an action and 
use their background knowledge as 
well as divergent thinking to list the 
possible consequences. 






APPUCATION 



1 . Develop open-ended questions for the next unit of 
study you will teach. Develop one question for each 
of these levels of Bloom’s Taxonomy: Application, 
Analysis, Synthesis, and Evaluation. 

a. Consider the instruction that students will need in 
order to be ready to answer these questions. 

Incorporate this instruction into your unit plan. 

b. Consider the practice in answering open-ended 
questions your students will need before such 
questions con become port of on assessment that 
counts os a grade. Incorporate this practice on 
answering open-ended questions into your unit plan. 

2. Match the opened-ended questions for your next unit 
to particular learning goals (from notional or state 
standards). Did you use some goals not listed in 
Figure 5.2? What were they? Shore these goals with 
their accompanying open-ended questions with a 
colleague. Ask the colleague to comment on the 
match between the goal and the question. In other 
words, would this question actually help determine 
student progress toward the goal? 

V / 

Student Responses to Open-Ended Questions 

If open-ended questions are to be included on a test that will be 
graded, it is important for teachers to prepare students for this task. 
After many years of only encountering multiple-choice testing, some 
students may have difficulty with open-ended questions. Students 
will need in-class practice on writing answers to open-ended ques- 
tions; they will need feedback on their practice performances; and 
they will need to understand the criteria that will be used to judge 
their responses. 

At first, student responses to open-ended questions may be short, 
somewhat incoherent, and not well developed. It may be difficult to 
judge their understanding of the concept, simply because they do 
not possess sufficient communication skills to convey their thoughts. 

To aid students in developing such skills, the teacher can use sev- 
eral techniques. For example, the teacher can pick the best student 
responses to read aloud to the class and then ask the class to critique 
their own responses in terms of whether or not they met the standard 
exemplified in the example read aloud. She may ask the class to 
articulate the criteria they are using to judge their own work, based 
on what they heard in the example. A class compilation of criteria, 
along with explanations of each criterion would be helpful in defining 
the quality expected in the responses. 

Students will need practice in incorporating these quality criteria into 
their own writing. Therefore, no grades should be taken until the sec- 
ond or even third administration of open-ended questions or until it is 
clear that students have had ample opportunities to understand the 
expectations. 





Grading open-ended questions invoives interpreting the quaiity of 
the response in terms of cieariy orticuiated criteria. FIGURE 5.3 dispiays 
severoi open-ended questions reiated to the apparent motion of the 
sun oiong with o rating scoie to use in assessing the quaiity of stu- 
dent responses. Criteria for the responses inciude scientific accuracy 
of the expianotion/description and the coherence of the response. 
Distinguishing between o score of 2 (accurate but not weii written) 
and o 3 (accurate and weii written) may heip to impress upon stu- 
dents the importance of structuring their responses so that they are 
coherent to the reader. 

FIGURE 5.3 

Open-Ended Questions With 
a Rating Scoie For Responses 

Questions: 

1. Why do we use the term “the sun’s apparent motion”? 

2. If we agree that the sun is not really moving across the sky, 
what is happening to make it look that way? 

3. At 9:00 AM, a shadow is west of a tree; at 4:00 pm, the shadow 
is east of the tree. Explain why this happens. 

4. Why do people in North Carolina see the sunrise before 
people in California? 

Rating Scale: 

0 - Incomprehensible/inaccurate explanation 

1 - Provides partially accurate explanation 

2 - Provides accurate explanation but not well written 

3 - Provides very well-written and accurate explanation 

) 

Source: Rita Elliot A.G. Cox Middle School, Pitt County Schools, Winterville, NC. 

Before impiementing open-ended questions in the ciassroom, the 
teacher wouid be advised to utiiize the foiiowing suggestions: 

• Be ciear about the purpose of such questions. What 
instructionai goais wiii they heip you assess? For exampie, in 
FIGURE 5.3, the instructional goal targeted by the question 
may be "students will be able to explain phenomena 
relevant to the Earth/sun system." 

• Answer the questions yourself before administering them 
to students. This will help you clarify your own expectations 
regarding an ideal student response. 

• Develop a rating scale or point system to use with the 
questions. Share this rating scale with students before they 
begin to work. (More information about developing such 
grading schema is included in CHAPTER 6.) 

• Read over a sampling of answers before you grade them. This 
will help you get an idea of the range of responses present 
for each question. It may be helpful to sort the responses 



into piles based on the rating soale being used (all the ones 
together, all the twos together, etc.) before assigning a final 
scale value to the response. 

The rating scale used in FIGURE 5.3 used scientific accuracy and 
coherence as criteria forjudging student responses. These crite- 
ria define what the teacher is expecting— what is being assessed. 
However, other assessments are possible. For example, student 
responses to open-ended questions can be analyzed to identify 
misconceptions or problems in understanding a concept. Rather 
than grading such questions, the teacher can choose to group the 
responses into categories of similar answers so that remedial instruc- 
tion can respond to the kinds of errors being made. 

Performance Tasks 

Although many achievement objectives can be assessed with 
paper-and-pencil tests, there are other objectives that require stu- 
dents to actually demonstrate their competence. In some situations, 
given the purpose of the assessment (e.g. licensing people to drive 
cars), a performance test is necessary. It would be unthinkable (and 
dangerous!) to license people to drive on the strength of a written 
test on driving rules. Likewise in science instruction, there may be 
some skills (science investigation skills, skills in using science equip- 
ment) that are most appropriately assessed by having students per- 
form tasks rather than take pencil-and-paper tests. 

Such performance tasks can be used to assess a variety of instruc- 
tional goals. Consider the Electric Circuits Performance Task 
described in FIGURE 5.4. This task has the ability to assess: 

1. ) Students' abilities to manipulate science equipment (in order 

to create working electrical circuits) 

2. ) Students' conceptual understandings of the two types of 

circuits (series and parallel) 

3. ) Students' abilities to self-assess and correct errors (checking 

to see if circuits work) 

4. ) Students' abilities to organize thoughts and express ideas 

coherently (writing answers to questions) 

5. ) Students' abilities to classify and analyze (relating how one 

circuit is different from another) 

6. ) Students' abilities to evaluate or choose alternatives 

(explaining why one particular difference is the most 
important) 

Would a pencil-and-paper test have been able to accurately assess 
all these student abilities? It is difficult to see how such a test could 
address numbers 1 and 3 above. 



FIGURE 5.4 

Electric Circuits Performance Task 



TASK: 

Your job is to draw two circuits. One is a series circuit, and the 
other is a paraiiei circuit. Each circuit has one battery, wire, a 
switch, and two iight buibs. To prove that your drawings are cor- 
rect, use the materiais in your science kit to nnake each circuit 
the way you have drawn it. When you connpiete drawing and 
nnaking the two circuits, you wiii answer these two questions: 

1 . What is the one important difference between a series 
and a paraiiei circuit? 

2. Why do you think that difference is the most important 
difference? 

PROCEDURE: 

1 . Review the assessment criteria for the Eiectric Circuits 
Performance Task (see Figure 5.5). 

2. Draw the series circuit. Use arrows to show the path of 
the eiectricity in the circuit. 

3. Make the series circuit you have drawn. 

4. Draw the paraiiei circuit. Use arrows to show the path of 
the eiectricity in the circuit. 

5. Make the paraiiei circuit you have drawn. 

6. Answer the two questions. 

V 

Source: Hibbard, M.K. (2000). Performance-based learning and assessment in middle 
school science. Larchmont, NY: Eye on Education, p. i22. Source materiai has been 
abridged. 



If science classes are to be about doing science, rather than just 
reading about science, then the use of performance tasks as the one 
shown in FIGURE 5.4 represents a better match to the overall instruc- 
tional goal. 

The rating scheme for the Electric Circuits performance task is dis- 
played in FIGURE 5.5. Depending on the purpose of the assessment, 
there are many different ways to judge how well students performed 
on the task. 



FIGURE 5.5 

Rating Scheme for the Electric Circuits 
Performance Task 



PORTION OF THE TASK 


TERRIFIC 


OK 


NEEDS WORK 


DRAWING THE 
SERIES CIRCUIT 


Drawn correctly so 
that it would work as a 
series circuit. Drawing 
is neat, organized, 
clear, and large. 


Drawn correctly so 
that it would work. 
Drawing is not clear or 
neat enough. 


Drawn so that it would 
not work as a series 
circuit. 


CONSTRUCTING THE 
SERIES CIRCUIT 


Circuit is made so 
it works as a series 
circuit. Circuit corre- 
sponds completely to 
drawing and uses the 
appropriate materi- 
als/quantities. 


Circuit is made so that 
it works, but the con- 
struction only partially 
matches the draw- 
ing or only partially 
conforms to required 
materials list. 


Circuit does not work 
as a series circuit. 


DRAWING THE 
PARALLEL CIRCUIT 


Drawn correctly so 
that it would work as 
a parallel circuit. 
Drawing is neat, 
organized, clear, 
and large. 


Drawn correctly so 
that it would work. 
Drawing is not clear or 
neat enough. 


Drawn so that it would 
not work as a parallel 
circuit. 


CONSTRUCTING THE 
PARALLEL CIRCUIT 


Circuit is made so it 
works as a parallel 
circuit. Circuit corre- 
sponds completely to 
drawing and uses the 
appropriate materi- 
als/quantities. 


Circuit is made so that 
it works, but the con- 
struction only partially 
matches the draw- 
ing or only partially 
conforms to required 
materials list. 


Circuit does not work 
as a parallel circuit. 


WRITTEN ANSWER 
TO QUESTION ONE 


Reason given clearly 
shows how the circuit 
wiring differs between 
the series and paral- 
lel circuit and refers 
to accurate drawings 
of the circuits to justify 
this reason. 


Reason given clearly 
shows how the circuit 
wiring differs between 
the series and parallel 
circuit. 


Reason given 
does not address 
differences in circuit 
wiring. 


WRITTEN ANSWER 
TO QUESTION TWO 


The answer contains 
a reference to the dif- 
ferences in the path 
of the electricity in the 
two different types of 
circuits and how this 
would affect the light- 
ing of the bulbs. 


The answer contains 
a reference to the dif- 
ferences in the path 
of the electricity in the 
two different types of 
circuits. 


The answer does not 
contain a reference to 
the difference in the 
path of the electric- 
ity in the two different 
types of circuits. 



Source: Hibbard, M.K. (2000). Performance-based learning and assessment in middle 
school science. Larchmont, NY: Eye on Education, p. 123-124. Source material has 
been abridged and adapted. 



It is critical to be clear on the elements or features of a desired, strong 
performance. This rating scheme appears to emphasize the following 
elements of the task: 

• Accuracy of drawings 

• Neatness of drawings 

• Accuracy of construction 







• Correlation between drawing and construction 

• Accuracy of differences between series and parallel circuits 

• Justification of answers 

These criteria are closely associated with, and accurately match the 
instructional goals addressed by the task. Therefore, this task has the 
potential to provide the teacher with valid assessment data relevant 
to the instructional goals. 

Consider the following in implementing performance tasks in your class: 



• Determine if a performance task is truly the best way to 
assess the learning target. For example, if basic knowledge is 
all that is required by the learning target, a multiple-choice 
question will be much more efficient in assessing this. Use 
performance tasks to measure instructional goals that call for 
a demonstration of abilities. For example, use performance 
tasks to demonstrate that students can actually "do science." 



Align the directions/procedures students will follow to the 
instructional goal/learning target. If you wish students to 
demonstrate lab safety skills, be sure that the procedures 
call for them to actually work in the lab (not just write about 
doing so). 

Align the rating scale to the instructional goal/learning tar- 
get. For example, on the lab safety skills performance task, 
the rating 



RESOURCES 



The following books may provide you with some ideas for creating 
performance tasks: 

Bosok, S.V. (2000). Science is... A sourcebook of fascinating facts, 
projects, and activities. Markham, Ontario, Canada: Scholastic 
Canada, Ltd. 

Center for Performance Assessment. (2001). Performance 

assessment series, Ciassroom tips and toois for busy teachers, 
Eiementary schooi edition. Englewood, CO: Advanced Learning 
Centers, Inc. 

Center for Performance Assessment. (2001). Performance 

assessment series, Ciassroom tips and toois for busy teachers. 
Middie schooi edition. Englewood, CO: Advanced Learning 
Centers, Inc. 

Glatthorn, A. A. (1998). Performance assessment and 

standards-based curricuia: The achievement cycie. Larchmont, 
NY: Eye on Education. 

Hibbard, M. K. (2000). Performance-based teaming and assessment 
in middie schooi science. Larchmont, NY: Eye on Education. 

Rezba, R.J., Sprague, C., Fiel, R.L, & Funk, H.J. (1995). Learning and 
assessing science process skiiis, Third Edition. Dubuque, Iowa: 
Kendall/Hunt Publishing Company. 



scale may 
include such 
criteria as 
"wore safety 
glasses," 

"accurately 
followed 
directions," 

"used equip- 
ment appro- 
priately," etc. 

Prepare 
students for 
performance 
tasks by 
allowing 
them to 
practice such 
tasks before 
grading them. 

Give students 
plenty of 
opportunities 
for feedback 
on their per- 
formances, 
by having 
students self- 

assess and peer-assess using the rating scale before you 
assess them with this same scale. 




The following list of performance task examples may help stimulate 
your thinking about ways you might implement this type of assess- 
ment in your science classes: 

Use the equipment and materials provided to make the 
necessary measurements to calculate the density of each 
material. 

Create a working electrical circuit. Use this circuit to test 
the items in the bag. Report if each item is a conductor of 
electricity or a nonconductor. 

Ask one member of your group to step into the pan of lime 
chalk (the type used to mark lines on the football field) and 
then a) walk normally and b) run, leaving lime footprints on 
the asphalt of the parking lot. Measure this student's height. 
Determine a relationship between height and stride that 
might be useful to a detective at a crime scene. 

Using the stream table, demonstrate the creation of an ox- 
bow lake. 

From the genetic information provided to you, construct your 
creature, ensuring that this creature has the appropriate 
number of legs, eyes, body segments and appropriate 
colors of body and eyes, and appropriate shapes and sizes 
of antennae and tails. Pair with another student and "mate" 
your creatures. Construct models of all possible children from 
this pairing. 



APPMCATION 



Choose one of the performance task examples stated 

above (or moke up one of your own). Then: 

• Describe how this task aligns with a notionoi or state 
standard (or standards). 

• Expioin why a performance task would be the best 
way to assess this standard (or standards). 

• Deveiop procedures/student directions for the 
performance task that aiign with the standard (or 
standards). 

• Create a iist of criteria you wouid use to judge student 
achievement of this performance task. Expiain how 
these criteria aiign with the standard (or standards). 

• Describe the types of practice you wiii provide to 
students before grading the task. 

• Describe how individuai students wiii receive 
feedback on their progress. 

V / 



Logs and Journals 

Open-ended questions and performance tasks are ways to assess 
student learning at a particular point in the instructional process. Like 
these two assessment methods, logs can also be used periodically 





to assess particular student actions or learning activities. Conversely, 
journals are dynamic (ongoing, continuous) assessment approaches 
that promote communication between the teacher and students, 
allow students to reflect on what they are learning, and foster stu- 
dents' active involvement in classroom activities. 

Logs 

A log provides documentary evidence of events and may also show 
the progression of such events. Students may be asked to keep scien- 
tific logs while running science experiments. The addition of growth 
factors to plants, as well as recorded heights of the plants at speci- 
fied intervals are examples of data included in such logs. A detailed 
log can also help convince a teacher that, indeed, the student per- 
formed certain actions. In addition, it can reveal the exact nature of 
those actions. Because of its documentary properties, logs are fre- 
quently utilized to support student assertions or conclusions. They are 
commonly used within science fair experiments to document actions 
that students took in solving problems. The advantages logs bring to 
assessment include the following: 

• They promote the achievement of instructional goals related 
to the nature of science. (Science is based on evidence; 
science findings are open to review.) 

• They provide a track record, showing exactly what the 
student did and did not do. 

• They help identify misconceptions and misunderstandings. 

• They assist students in analyzing their own work. (If something 
doesn't work, students can track their own progress and then 
make necessary changes or corrections to ensure success.) 

Journals 

Journals are similar to logs, in that they provide a record of the pro- 
gression of events. Generally, journals do not have the legalistic, 
evidentiary purpose of a log. While a journal documents events, it fla- 
vors those events with the opinions, feelings, and perceptions of the 
author. This "flavoring" of data with the consciousness of the author 
is what makes journals so useful to teachers. For example, a teacher 
may ask her science students to record "what you learned today." By 
reading the journals, the teacher can ascertain not only which con- 
cepts were conveyed to students, but also the level of understanding 
of the concepts achieved by her students. Such a journal entry would 
also encourage student meta-cognition as they assess their own lev- 
els of understanding. 

Checking comprehension is only one use of journals. Journals can 
also be used to foster student reflection and critical thinking skills. 
Students can write journal entries as they begin to tackle a science 
problem, recording the question to be investigated and their own 
predictions. After the science investigation, students can revisit their 
predictions, explain why their predictions were or were not accurate, 
and reflect on the meanings or understandings they have derived 
from the investigation. Such reflections ask students to analyze their 
own thought processes and emphasize what changes in thinking 
have occurred. Such information on thinking changes is invaluable to 
the student and teacher. 



Often, teachers provide prompts in order to encourage student 
writing in journais. The prompts iisted in Figure 5.6 show how journoi 
entries foster criticoi thinking at the higher ieveis of Bioom's Taxonomy. 



FIGURE 5.6 

Journal Prompts That Promote 
Critical Thinking Skiiis 



JOURNAL PROMPT 


LEVEL OF BLOOM’S 
TAXONOMY 


1. Write three fun riddles for your friends. 
First, describe a solid, but don’t tell your 
friend what it is. Give clues about the 
object. Start with its size and shape, how 
it feels, where you might find it, and its 
color, but remember not to name it. Then 
do the same for a liquid and a gas. 


Application 


2. Gravity is gone. Now everything is 
floating. Flow will this change the way 
you play during recess? Write a story 
about how you and your friends played 
without gravity. 


Analysis 


3. The sun is hiding, the crops are not 
growing, and everyone is going 
hungry. Write a fable that tells why 
the sun decided to hide and how he 
was convinced to come out again 
after seeing the effect he had on the 
germination, growth, and development 
of plants. 


Synthesis 


4. You have earned a lot of money, and 
you decide to use it to buy a pine forest 
so that those trees won’t be cut down to 
make toothpicks. Your friend thinks you 
are silly to spend your money protecting 
the forest. Write a letter to your friend and 
explain why it is so important to protect 
this habitat. Remember to describe how 
many animals depend on the forest for 
food and shelter. Be sure to also say how 
humans benefit from the pine forest. 


Evaluation 



Source of journal prompts: Whited, A. M. (Ed.). (2005). Nonfiction writing prompts for 
science. Lower eiementary. Englewood, CO: Advanced Learning Press. #/, p. 29; #2, p. 
39: #3, p. 57: #4, p. 53. 



Journais can aiso be used to assess attitudes toward science. 
Students con write their thoughts and feeiings about ciass events. This 
use of journais os an expressive outiet for students is best seen as o 
two-way communication. That is, if the teacher does not respond to, 
probe, chaiienge, or ask for eiaborations about the entries submitted, 
the fuii benefit of the journais wiii not be reoiized. Since students ore 
being asked to shore their own perceptions and opinions, there can 
be no wrong answers, and it is important that the teacher's responses 
reinforce this "no risk" environment. From such journais, teachers can 
gain vaiuabie insights into the diverse interests, abiiities, and attitudes 





of the students in their science cicsses. Soiiciting such information 
from students can aiso positiveiy affect student motivation to iearn. 

The way journais are graded depends on the purpose of the journai 
and the age of the students. The act of keeping a journai can be 
considered as an objective in itseif if the teacher beiieves that stu- 
dents need to structure, take charge of, or feei ownership in their own 
iearning. The criterion for success on this objective might be the com- 
pietion of the assigned journai entries or pages, not necessariiy the 
quaiity of the entries, in this scenario, rather than grading the content 
of the journai, students are awarded points in a grading period if they 

have compieted journai entries. 



APPLICATION 



Reflect on the units you teach through- 
out one school year. Write one journal 
prompt for each level of Bloom’s 
Taxonomy that you could use with 
some or all of these units. 

V 

purposeful, integrated coiiection of student work showing effort, 
progress, or a degree of proficiency. Physicaiiy, the portfoiio is a con- 
tainer of evidence of a student's achievements, competencies, or 
skiiis. it is purposefui in that the coiiection is meant to teii a story about 
achievement or growth in a particuiar area, if muitipie-choice and 
compietion items are at one end of the assessment continuum repre- 
senting very brief, quantitative, one-shot records of student achieve- 
ment, then portfoiios are at the other end, representing compiex, 
quaiitative, and progressive pictures of student accompiishments. 

Why use portfoiios? Portfoiios may best 
be considered as toois to promote 
communication between the student 
and an outside audience (the teacher, 
parents, prospective empioyers, 
etc.) about student understandings, 
strengths, weaknesses, progress, and 
seif-refiections. The use of portfoiios, iike 
any assessment method, starts with a 
consideration of these purposes. What 
are the objectives/standards that the 
portfoiio wiii heip the students achieve? 

Why are these objectives best assessed 
via a portfoiio? What is the portfoiio supposed to demonstrate? 
Severai different exampies of purposes for using portfoiios in science 
ciasses are iisted beiow: 

a. LEARNING TARGET: Ability to design an experiment 
USE OF PORTFOLIO: To show progress in this abiiity over the year by 
inciuding work on different assignments. Additionaiiy, if the objec- 
tive was to understand how students go about designing an exper- 
iment, the portfoiio couid contain aii activities, drafts, and revisions 
ieading up to the finai design. Students couid write reflections 
about their thinking at different stages in the design process. 



RESOURCES 



The following resource may help you in 
impiementing portfoiio assessment: 

Barton, J. & Coiiins, A. (Eds.). (1997). 
Portfolio assessment: A handbook 
for educators. Menio Pork, CA: 
Addison-Wesiey Pubiishing 
Company. 
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Portfolios 

Portfolios, like 
logs and jour- 
nals, contain 
collections of 
student work. 
A portfolio is 
defined as a 






b. LEARNING TARGET: Improving creative writing using science 
content knowiedge 

USE OF PORTFOLIO: To showcase the students' favorite/best pieces of 
creative writing. Such a portfoiio couid invoive parents in heiping 
students reflect on and choose their "best" pieces. 

c. LEARNING TARGET: Read, summarize, and evaiuate information in 
newspaper articies on science topics 

USE OF PORTFOLIO: The portfoiio may represent evidence of students' 
increasingiy sophisticated efforts at critiquing these articies. 

d. LEARNING TARGET: Show evidence of basic content knowiedge 

USE OF PORTFOLIO: Students couid assembie aii written tests into the 
portfoiio and write a reflection piece after each test on how they 
couid improve their performances. 

e. LEARNING TARGET: Identify ieorning strengths/weaknesses 

USE OF PORTFOLIO: Students couid assembie their coiiections based 
upon their own strengths and weaknesses. Progress on weaknesses 
couid be documented and reflected upon. 

There is no particuiar right or wrong way to impiement or use portfo- 
iios in the ciassroom. Rather, designing a portfoiio represents a series 
of decisions. Some of the design questions to be answered after the 
instructionai objective has been determined are iisted beiow (Butier 
& McMunn, 2006, in press): 

1. Is the purpose of the portfolio to instruct, to support learning, or to 
assess? Answering this question wiii heip determine the types 
of artifacts the students wiii coiiect. For instance, a portfoiio 
assembied soieiy for end of course grading purposes may 
contain oniy best-work pieces rather than a continuum of 
student work. 

2. What is the goal of using portfolios? For exampie, is the goai 
to promote seif-assessment? Student reflection on their own 
iearning? Probiem soiving? Particuiar skiiis? Fiigher-order 
thinking? The goai for the portfoiio wiii determine the design 
of the portfoiio. For exampie, to promote seif-esteem, a 
best works or memorabiiia portfoiio appears appropriate. 

Fiowever, if the goai is to improve students' proficiency 

at content-reiated skiiis, a skiiis portfoiio wouid be best. A 
portfoiio promoting student reflection wouid contain many 
subjective, journai-iike artifacts, whereas a probiem-soiving 
portfoiio wouid contain more objective work. 

3. What types of artifacts will be collected in the portfoiio? Wiii oniy 
written work be accepted, or wiii videotapes, posters, and 
computer disks aiso be acceptabie? Fiow many artifacts are 
necessary for documentation of a skiii, goai, or purpose? The 
decisions made here wiii impact the size of the portfoiio and 
its physicai characteristics and may be influenced by the 
storage capacity of the ciassroom ! if a flie foider or binder 

is used, then perhaps oniy written work can be accepted, 
if an eiectronic portfoiio is pianned, aii data may be stored 
on a disk or CD. The use of a singie quality entry to prove a 
skiii is recommended over the use of muitipie entries for that 
one skiii. (Sureiy if the student was successfui once, he can 
be so again!) The number of artifacts aiso defines the type of 



portfolio tool. If o// student work is collected in the portfolio, 
the purpose is lost, and the assessment tool is just a notebook, 
not a portfolio. 

4. How will artifacts be selected for the portfolio? Will the students 
select them, or will the teacher select them? How often 
will the portfolio be updated by adding artifacts? Must the 
students keep copies of all potential portfolio artifacts, or 
will the teacher maintain a file for this purpose? What are 
the criteria for selecting artifacts (how will the teacher or 
the students decide if a particular artifact documents a skill, 
purpose, or goal)? If the portfolio is intended to promote self- 
assessment, the students should choose the artifacts. Older 
students may keep their own working files, while younger 
ones may need help with this process since they have not yet 
developed organizational skills. 

5. How will students be oriented to the use of the portfolio? The 

recommended method is to start slowly, giving students 
plenty of support and practice. A structured portfolio, 
in which expectations are explained to students, is 
recommended over a more open, unstructured design. 
Remember that change is difficult; be prepared for some 
student resistance to this new procedure. Perseverance and 
consistency are two key factors in the success of portfolio 
implementation. 

6. How will the portfolio be assessed? A scoring guide, or rubric, is 
essential for this task, and this scoring guide should be shared 
with students before the assessment begins. However, if the 
work in the portfolio has already been assessed as individual 
pieces, should the overall portfolio also receive a grade? 

7. How will the information in the portfolio be housed? storage and 
handling of student information can be quite overwhelming, 
especially if the portfolio is one that travels with a student 
over an extended period. Many companies have developed 
software that helps manage the materials stored in a 
portfolio. Many of the student management systems used in 
districts also have a portfolio component housed within the 
management system for teacher and student use. 

8. What planning should be done before asking students to compile 
a portfolio? Constructing the portfolio scoring guide before 
assigning the work will prevent student frustration, enhance 
the matching of the purpose to the artifacts, and ease 
the assessment task for the teacher. It is much simpler to 
assess an assignment if the plan for assessing it is written 
beforehand. Through careful planning, the teacher does not 
have to dread the moment he must confront a huge mound 
of papers, wondering what he will find in the contents of the 
portfolios his students have constructed. 

Once these design questions are addressed, the portfolio can be 
planned and implemented. Like any of the other methods of evalu- 
ating student work, portfolios involve the development of criteria for 
judging good work. FIGURE 5.7 lists criteria appropriate for a portfolio 
designed to address the instructional goal: Students will read, sum- 
marize, and evaluate information in newspaper articles on science 
topics. 



FIGURE 5.7 

Appropriate Criteria for a Newspaper 
Articie Portfoiio 



r N 

Accurate summaries of at least eight 

newspaper articles 

• All selected articles address science topics. 

• All selected articles address DIFFERENT science topics. 

• All original evaluations of articles ore present. 

• Each original (previously assessed) evaluation 
Is followed by a student reflection on strengths/ 
weaknesses of the evaluation (explains why the grade 
was justified). 

• Ending reflection explains how weaknesses found 
In each original evaluation were addressed and 
summarizes changes that occurred over the course of 
the year In writing evaluations. 
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AI^UCATION 



Identify one instructional gool/leorning 
target/national or state standard that could be 
addressed by implementing portfolio assessment. 
Work through the portfolio design questions to 
plan such on implementation. For number 6, it is 
sufficient to simply list important criteria, rather 
than develop the entire scoring guide. 



Exhibitions/Projects 

Exhibitions and projects provide 
opportunities for students to 
perform "real-life" tasks or wres- 
tle with complex challenges. 
Such assessment types are usu- 
ally of longer duration than per- 
formance tasks. Exhibitions and 
projects may run throughout o 
six-week grading period, or In 
the case of some culminating projects, extend to a year-long period. 
Exhibitions and projects have multiple steps (e.g. planning, research- 
ing, designing. Implementing, etc.) and multiple criteria ore needed 
to judge them. Students may be asked to structure on approach to 
a problem. Investigate alternatives, produce o response, and justify 
approaches taken. More often than not, the tasks are assigned to 
teams of students, os that Is how many "real-world" problems are 
tackled. Through such complex experiences, however, students 
develop Into cooperative team members, problem-solvers, effective 
thinkers, quality producers, and self-directed learners. 



Exhibitions and projects, like all performance-based assessments 
should be designed and selected to teach core curriculum content 
standards and should be scored using a rubric which was shared with 
students "up-front." Students can be given some choice as to the 
activities they will perform or the roles they will assume within the proj- 
ect. In addition, students should be required to meet Interim dead- 
lines for the project (which will old the procrastinating student), to 
participate In planning the project (old for the disorganized student), 
and to reflect on project activities (old for the "surface" learner). 





Of course, projects at different grade levels will vary in level of dif- 
fioulty. The following examples may help in planning projects and 
exhibitions: 

Elementary Level 

• Students study the systems of the body and make life-size 
posters showing the looation of major body organs. 

• Students plan and design an appropriate baokyard play 
area for a pet. 

• Fourth-grade students run the sohool weather station, 
devising the weather instruments, using them to colleot data, 
and making prediotions about the weather, whioh are then 
reported during the morning announoements over the Public 
Address system at the school (National Researoh Counoil, 

1999 ). 

Middle School Level 

• Students design and build model racecars to test the effeot 
of tire sizes, gear ratios, and body design. 

• Students ohoose a topio, plan, write, and produoe a skit 
based on a soientifio oonoept or prinoiple. 

• Teams of students oompete to construot a framework that will 
support a full cup of water. The lightest framework using only 
allowed materials will win. 

High School Level 

• Science students reclaim an endangered estuary through 
clean up efforts and then turn the estuary into a "living 
classroom" for elementary students. 

• Teams research one inherited human disorder and report 
to the whole class on mode of inheritance, symptoms, 
frequency of occurrence in the general population as well as 
specific populations, care needed for those suffering with the 
disorder, and effect on society (National Research Council, 

1999 ). 

• Students compete in science competitions in which they 
design and perform experiments to answer a research 
question. 

• Students take on the roles of health care and support 
personnel in a hospital faced with a decision on whether to 
operate to separate conjoined twins. 

• As a graduation requirement, individual students must 
perform research, write a research paper, and present their 
findings to an outside audience in order to complete their 
senior projects. 

Projects and exhibitions, like the examples listed above, often serve 
two purposes, instruction and assessment. Just being involved in a 
project will require that students learn new knowledge and new skills. 
The grading criteria, shared with the students before they begin their 
work, can also be very instructional, in that such criteria spell out for 
students what quality work will entail. The teacher is using the project 
or exhibition, however, to find out what students know or are able to 
do, which is certainly an assessment function. 



Projects and exhibitions are often ideaiiy suited to the science ciass- 
room, as they require students to "do science," not simpiy read about 
it. impiementing such performance-based assessments in the ciass- 
room can be time consuming and chaiienging. The foiiowing sugges- 
tions, adapted from Davey and Rindone (1990), may assist teachers 
in pianning: 

1 . Start with an issue, idea, scenario, or probiem and test it by 
asking how important it is; how engaging it wouid be to stu- 
dents; how reievant it is to "reai-iife"; and what content areas 
couid be iearned within the content of the project/exhibition. 

Ask, "Does this aiign with my curricuium?" Considering the 
time investment, ensure that the project wiii provide rich data 
about mastery of a number of standards, if possibie. 

2. Begin to define the task more fuiiy by asking what knowi- 
edge, competencies, skiiis, or dispositions students wiii have 
to use to compiete the project or exhibition. This wiii focus 
attention on the project outcomes and on instructionai 
objectives/iearning targets. Revise and eiaborate on the proj- 
ect untii the iearning targets aiign with the task that students 
are asked to perform. 

3. Consider the context of the project/exhibition. What is the 
most appropriate medium for students to use (orai presenta- 
tion, written product, computer simuiation, a debate, a town 
meeting, etc.)? Shouid the task be done individuaiiy or in 
groups? Shouid experts from the community be accessed? 

4. Consider the administration of the project/exhibition. What 
do students need to know before the work begins? What dif- 
ficuities might be encountered? How wiii students receive 
assistance? 

5. Consider how students' work on the task wiii be assessed. Wiii 
there be a checkiist for work processes to guide students in 
the process of compieting the task? What are the important 
features of a successfui product? Who might assess student 
performance other than the teacher (peers, community pro- 
fessionais, other schooi staff, etc.)? 

6. Taik over the proposed project/exhibition with coiieagues 
and with students. Ask them to review the pian as weii as 
the criteria that wiii be used to judge student work. Revise as 
needed, once the reviews are in. 



APPMCATION 



1 . Choose a project that you hove used in the post with your students. Refiect 
on the strengths and weaknesses of this project. Revise os needed. 

2. Pion a new project for your science cioss. Use the six suggestions above to 
heip you in this process. 








One Last Look at Performance-Based 
Assessment 

In the last three chapters, we have examined many types of perfor- 
mance-based assessments. All hove been clustered, however. Into 
three main categories: 

• Observing students 

• Soliciting Information from students 

• Evaluating student work 

The performance-based assessments that toll Into these categories 
all go beyond multiple-choice testing, and they hove several quali- 
ties In common. They: 

• Promote doing science. 

• Stimulate higher-order thinking skills that ask students to 
do more than simply recall basic facts. Instead, these 
assessments ask students to apply, analyze, synthesize or 
evaluate. 

• Present "real-life" challenges and promote the learning of 
"real-life" skills. 

• Encourage self-assessment and self-reflection. 

• Promote the development of self-directed learners, as 
many performance assessments provide students with the 
autonomy needed to evaluate their work. 

• Provide more valuable Insights Into student thinking and 
student learning for teachers than do answers on multiple- 
choice tests. 

• Allow Instruction and assessment to overlap, as students 
learn to become quality producers while they ore learning 
essential science content. 

In order for performance-based assessments to be successful and 
effective, teachers must devote much "up-front" time to plan- 
ning. These assessments must be carefully constructed to help 
students achieve particular standards, objectives, learning targets 
or expected outcomes of Instruction. They must be aligned to the 
actual Instruction that occurs on a day-to-day basis In the class- 
room. Finally, students must clearly understand the directions for the 
performance-based assessment, be provided time to practice such 
assessments, and be given grading schemes for the assessment 
before beginning work. 

The next chapter In this publication, then, focuses on providing stu- 
dents with clear descriptions of teacher expectations, giving them 
timely and meaningful feedback on their performances, and reporting 
assessment results. To this end, CHAPTER 6 covers RUBRICS AND GRADING. 



CHAPTEI^ SIX 




RUBRIC'S AND Grading 



In this manual, we have reiterated several times that performance- 
based assessments are more time consuming and complex to create 
than simple, fact-based multiple-choice tests. However, we have 
encouraged teachers to implement these types of assessments 
because of the many benefits that result from such implementation. 
(See the list at the end of CHAPTER 5.) Such benefits far outweigh the 
difficulties involved in planning and implementing performance- 
based assessments. We must clearly articulate the difficulties so 
that teachers con a) be prepared for these and not ambushed or 
surprised by them, b) set realistic time schedules for planning and 
implementing performance-based assessments (allowing increased 
amounts of "up-front" planning time), and c) devise strategies to help 
allay difficulties. 

In this chapter, we address one more difficulty: grading perfor- 
mance-based assessments. Like planning and implementing these 
types of assessments, grading provides many challenges that teach- 
ers using only multiple-choice tests will not encounter. Grading per- 
formance-based assessments will never be as easy os running the 
sheets through the Scan-tron machine, but there are methods and 
strategies teachers can use to make this process more manageable. 

Reflect for a moment on an assignment you dreaded to grade. Then, 
read the scenario described in FIGURE 6.1. 

FIGURE 6.1 

The Science Fair Scenario 

r \ 

An interview with Micah, a middie schooi teacher, reveaied the 
foiiowing scenario: 

I once taught in a middle school where every 8th- 
grode student was required to complete a science 
fair project and enter it in the school science fair. A 
great deal of time in science class was, naturally, 
devoted to this project, particularly in the third grad- 
ing period as the end of this period coincided with the 
dote of the school science fair. On the day that the 
projects were due, parents and students descended 
on my classroom, bringing backboards and science 
equipment to display. By the end of the day, I hod at 
least 160 backboards stacked in the back of my class. 

It was then that it really hit me— I had to grade these 
projects before the school science fair. I had about 3 
days to do this. What would I do? 

I began by setting up the backboards from three of 
my best students. By looking at their work, I started to 
make a list of criteria I could use to grade the proj- 
ects. I assigned points to the criteria and then began 
to grade other backboards. 
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Unfortunately, in the Micah scenario, we find that insufficient plan- 
ning time was devoted to designing the project. We recommend 
that teachers select the criteria to be used before assigning the proj- 
ect. In this manner, teachers can be assured that the grading criteria 
actually align with both the instruction and the instructional goals, 
and students can be informed of teacher expectations before they 
begin to work. 



The technique itself is not a bad one: It is easier to create grading 
criteria by looking at student work. Teachers generally find it helpful 
to look at high-quality and low-quality student work in order to get a 
complete picture of the range of work possible. It would have been 
more helpful, however, if Micah had accessed previous student work 
(such as those projects from last year's science fair). That way the 
assessment design could occur before this year's students began 
to work, and the design could be shared with students up-front, not 
unfairly used to grade them after they completed their work. 



Teachers always use a set of criteria to grade 
student work, even if they fail to articulate the 
criteria to themselves or to students. Grading 
schemes help make these hidden criteria vis- 
ible to students. Once students clearly under- 
stand the expectations, they can more easily 
work toward achieving these expectations. 
Therefore, this chapter is devoted to making 
grading criteria visible and accessible. Several 
different methods are available for informing 
students of grading criteria before they begin 
work. These include point systems, checklists, 
and rubrics. 

Point Systems 

A point system assigns points for certain fea- 
tures of the student's response. Open-ended 
questions are often scored with this approach 
because points can reflect partial as well as 
full credit for a response. 



APPMCAT10N 



Discuss the Micah scenario with a 
coiieogue. Use the foilowing to guide 
this discussion: 

• Describe o time when you felt 
overwhelmed about grading. 

What did you do to get through 
this experience? 

• How do you feel about Micoh's 
strategy of looking at work from 
his three best students to 
determine criteria to use in 
grading other's work? Was this o 
good or bod idea? 

• How fairly or unfairly do you think 
Micoh's students were grading on 
the science fair project? Why? 
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For example, if third-grade students are given 

the appropriate equipment and asked to find 

out if stirring makes any difference in how fast sugar cubes and loose 

sugar dissolve (NAEP, 1986), the point system may be similar to the 

one shown in FIGURE 6.2. Here, the points are used to score students' 

oral responses to the question: Did stirring make a difference in how 

fast the two types of sugar dissolved? 




FIGURE 6.2 

Scoring the Sugar Question 



POINTS 

AWARDED 


DESCRIPTION OF RESPONSE 


4 


If the response states that both types of sugar dissolve faster when stirred, 
but loose sugar still dissolves faster than cubes 


3 


If the response indicates that stirring makes a difference but doesn’t 
describe the fact that loose sugar dissolves faster than cubes 


2 


If the response describes the relative speed (loose dissolves faster) but not 
the effects of stirring OR if the response just describes what happens (e.g., 
stirring makes the cubes come apart) 


1 


Incorrect response 


0 


No response 



Point systems are useful, then, in scoring responses where partial 
credit may be given. They are most helpful in scoring short answer 
open-ended questions, however, rather than in grading essay ques- 
tions. Extended response essay questions entail more complex, 
detailed answers and therefore engender the need for a more com- 
plex grading system (see the Rubrics section below). 

Checklists 

Checklists, like point systems, are often used when complex responses 
are not expected from students. Checklists, however, are more likely 
than point scales to be used in judging student actions or behaviors. 
For example, a checklist can be used to indicate that a student has 
effectively completed the steps involved in a task or demonstration. 
FIGURE 6.3 displays a checklist that could be used when evaluating 
student knowledge of the parts of the microscope and when evalu- 
ating students on the proper operation of the microscope. 



FIGURE 6.3 

Microscope Checklist 



CORRECTLY IDENTIFIES: 


PERFORMS THE FOLLOWING OPERATIONS CORRECTLY: 


□ Stage 


□ Swings low power objective into place 


□ Stage clips 


□ Places slide on stage and secures with clips 


□ High power objective 


□ Uses coarse adjustment to move low power objective 


□ Eye piece 


as far down as possible 


□ Coarse adjustment knob 


□ Looks through eyepiece 


□ Fine adjustment knob 


□ Uses coarse adjustment knob to raise low power 
objective until object is in focus 




□ Uses fine adjustment knob to bring object into focus 

□ Swings high power objective into place 

□ Uses fine adjustment knob to bring object into focus 





As the example in FIGURE 6.3 shows, checklists con be useful in evalu- 
ating simple student actions, particularly ones where there are a 
limited number of options. Examples include noting the presence or 
absence of certain actions (secures slide with clips), the sequence 
of actions (uses coarse adjustment knob to move low power objec- 
tive down before attempting to focus with fine adjustment knob), or 
whether o student has given a correct or incorrect answer (correctly 
identifies microscope port). 

Checklists ore also effective in getting students to check their own 
work. For example, prior to taking up notebooks, o teacher may pro- 
vide o checklist to students listing oil the assignments that should be 
included. Students con use the checklist to evaluate the complete- 
ness of the notebook before handing it in. 

Rubrics 

So for, we hove examined grading schemes that con be used in scor- 
ing relatively simple performance-based assessment, os short answer 
questions or simple student actions with limited options. Many per- 
formance-based assessments coll for long or highly complex student 
responses. For these types of responses, o rubric is useful because o 
rubric con take into account many different criteria forjudging stu- 
dent work. Rubrics con help students begin to understand that there 
are levels of quality to their work and to their thinking. Rubrics can 
aid students in learning that high-quality work is important. Too many 
students just turn in work to get it done, get the check for comple- 
tion, and don't worry about crafting a high-quality response or prod- 
uct. Taking their work lightly can come back and hurt them down 
the road, in college or in their chosen professions. Rubrics, because 
they demonstrate the various levels of proficiency and because they 
define high-quality work, con help students to croft the desired high- 
quality products and responses. 

There are two main types of rubrics: analytical and holistic rubrics. 

This section will define and give examples of each type before clos- 
ing with a discussion of the advantages and disadvantages of each. 



Analytical Rubrics 

The most common format of on analytical rubric consists of o list of 
criteria down one side with proficiency levels listed and described 
across the page. FIGURE 6.4 provides o simplistic example of this for- 
mat for on analytical rubric used to score o physics problem-solving 
task. In this particular example, descriptions of the proficiency levels 
(Exceeds goals. Meets goals. Approaches goals. Goals not yet met) 
are not present. 



FIGURE 6.4 

Physics Problem-Solving Rating Scale 





EXCEEDS 


MEETS 


APPROACHES 


GOALS NOT 


CRITERIA 


GOALS 


GOALS 


GOALS 


YET MET 



1. Correctly states the problem 
and Identifies the Information 
needed to solve It and the 
steps needed to arrive at a 
solution. 

2. Produces reasonable 
estimates of data values that 
were not identified by the 
teacher but needed for the 
solution to the problem. 

3. Applies concepts and 
formulas related to motion 
(velocity, acceleration, 
average speed). 

4. Makes accurate conversions 
as needed to solve the 
problem. 

5. Communicates conclusions 
clearly, using examples as 
needed. 



Source: Adapted from Davey & Rindone (1990), "Anatomy of a Performance Task." 
Presenfed at the American Educational Research Association meeting, Boston, MA, 
from materials developed by the Bureau of Evaluation and Student Assessment 
Connecticut State Department of Education. 

As this rating scaie shows, students hove the opportunity to receive 
scores in severai different dimensions (i.e., they wiii get o score for 
each criterion iisted). Anaiyticoi rubrics can heip both teachers and 
students diagnose strengths and weaknesses in a performance. This 
enobies students to target the areas of their performances that need 
to be improved. 

A rating scaie iike the one dispioyed in FIGURE 6.4, however, may 
moke too many assumptions about ciority to be usefui to students, in 
other words, since no descriptions of the proficiency ieveis are pres- 
ent, students may have difficuity in understanding teacher expecta- 
tions. FIGURE 6.5 displays o rubric that does provide descriptions at 
each level of proficiency. 








FIGURE 6.5 

Poster Displaying Science Principle 
Analytical Rubric 



CRITERIA 


EXCELLENT 
(4 POINTS) 


ACCEPTABLE 
(3 POINTS) 


RE-DO 
(2 POINTS) 


SCIENTIFIC 

ACCURACY 


Scientific principie 
is accurateiy and 
compieteiy stated and 
supported by weii- 
expiained, accurate 
exampies from reai iife. 


Scientific principie is 
accurateiy stated and 
supported by accurate 
exampies from reai iife. 
Exampies are not as ciear 
as they coud be. 


Scientific principie is 
inaccurateiy stated OR no 
exampies are present. 


ORGANIZATION 


information is organized 
iogicaiiy, in that a theme 
(or themes) is easiiy 
seen (e.g., chronoiogi- 
cai theme, ciassification 
theme, etc.) 


information is 
somewhat organized 
but organization is not 
consistent throughout. 


No theme is present. No 
organization is apparent. 


GRAPHICS 


Poster contains at ieast 3 
graphics. Graphics sup- 
port the understanding of 
the principie by both pro- 
viding reai-iife exampies 
and giving schematics 
expiaining how the prin- 
cipie functions. 


Poster contains graphics, 
but graphics don’t 
consistentiy show 
understanding of the 
principie (weak exampie 
or unciear schematics). 


Poster contains no 
graphics or graphics 
that don’t reiate to the 
principie. 



APPMCATION 



Compare and contrast the scoring examples shown in 
FIGURES 6.4 and 6.5. 

• Which approach would be more useful to students in 
planning and implementing their 

pe rfor ma n ces/ prod u cts? 

• Which would be more useful in encouraging student 
assessment? 

• Which gives the clearer picture of teacher 
expectations? 

V 



Holistic Rubrics 

Rather than assigning separate scores for each important aspect of 
a performance, hoiistic rubrics consider aii the criteria simuitaneousiy 
and resuit in a singie summary rating or grade. This type of rubric may 
be more appropriate when the purpose is to provide students with an 
overaii index of their performance on a task or product. Hoiistic rubrics 
are aiso used more often in cuiminating assessments, after students 
have aiready received feedback on their progress during the iearning 
process. Figure 6.6 shows an exampie of a hoiistic rubric for the some 
performance task used in FIGURE 6.5 (creating o poster dispiaying o 
scientific principie). 









FIGURE 6.6 

Poster Displaying Science Principie Hoiistic Rubric 



OVERALL 

SCORE 


DESCRIPTION OF PERFORMANCE 


4 


Scientific principie is accurateiy and compieteiy stated. Student gives weii- 
expiained and accurate exampies from reai iife. Poster is organized iogicaiiy, 
in an easiiy discernabie theme. Poster contains at ieast three graphics which 
iiiustrate both reai-iife exampies of the principie and provide schematics to 
expiain how the principie functions. 


3 


Scientific principie is accurateiy stated. Student gives severai accurate 
exampies from reai iife but they couid be expiained mare cieariy. Poster is 
organized, but organization or theme is inconsistent in pieces. Student can 
expiain the organizationai theme, however. Poster contains severai graphics 
that both iiiustrate reai-iife exampies and provide schematics. 


2 


Scientific principie is stated correctiy, but may be somewhat incompiete. 
Student gives at ieast two accurate exampies from reai iife but the 
expianations are somewhat unciear. Poster is partiaiiy organized, but no 
discernabie theme is present. Poster contains graphics, but the graphics are 
weak exampies. 


1 


Scientific principie is inaccurateiy stated or fewer than two accurate exampies 
are given. No organizationai theme is present. Poster contains no effective 
graphics. 



APPLICATION 



Compare the analytical rubric displayed in FIGURE 6.5 with 

the holistic rubric shown in FIGURE 6.6. 

As a teacher, consider: 

• What are some advantages and disadvantages to 
creating and using analytical rubrics? 

• What are some advantages and disadvantages to 
creating and using holistic rubrics? 

Ask students to consider: 

• What are some advantages and disadvantages in 
receiving and being assessed using analytical rubrics? 

• What are some advantages and disadvantages in 
receiving and being assessed using holistic rubrics? 










Advantages and Disadvantages of the Two 
Types of Rubrics 

One distinct advantage of anaiyticai rubrics is that this type of 
rubric gives more meoningfui feedback to students. Since students 
receive scores for each criterion, it is easy for students to discern 
their strengths and weaknesses. For the teacher, however, anaiyticai 
rubrics can be a iittie more time consuming to use, as each student 
must receive a score in every dimension on the rubric. Because of the 
potentiai for more defined feedback, however, anaiyticai rubrics are 
very usefui to use during the iearning process. They provide structure 
and support for iearning and heip students cieariy understand the 
teacher's expectations. They are very heipfui in promoting student 
seif-refiection and seif-assessment, as students can check their own 
work against the descriptions of high-quaiity work provided on the 
rubric. 

Hoiistic rubrics are usuaiiy easier for teachers to use in giving grades. 
However, because oii the characteristics of the work are iumped 
together to create o singie score, it is harder for students to under- 
stand their strengths and weaknesses. 

Final Thoughts About Grading Schemes 

As o teacher begins to impiement performance-based assessments, 
it is important to examine aiternative ways of grading assignments. 
Muitipie-choice items can be scored very objectiveiy. The student is 
offered o fixed number of options, and the option seiected is com- 
pared to a scoring key (containing the "right" answers). Given the 
scoring key, any teacher wouid score the muitipie-choice items in the 
some way. Performance-based assessments such os open-ended 
questions, journois, portfoiios, performance tasks, and exhibitions and 
projects often have no one right answer. Therefore, a different way 
to score these items must be deveioped. A iist of criteria is needed, 
oiong with descriptions of proficiency ieveis. Deveioping point sys- 
tems, checkiists, or rubrics heip define quoiity work for the students 
and heip the teacher score assignments from different students using 
the same criteria. Such grading schemes therefore enhance the 
objectivity of the teacher in judging student work. 

The foiiowing guideiines may heip you in deciding when to use a 
porticuiar grading scheme and in creating high-quaiity grading 
schemes that con enhance student performance: 

• Examine the task and choose the most appropriate format 
for the grading scheme, if a short answer is needed, o point 
system couid be the best choice. For iimited option tasks 
(the student either did o porticuiar action or did not do o 
porticuiar action), checkiists may be best. For highiy compiex 
or extensive responses/behaviors, a rubric may be needed. 

• Make a iist of criteria to use in scoring student work by 
examining work from post years. Look at both high-quaiity 
and iow-quoiity work to estabiish o range. Think about the 
weaknesses that occurred most often in post work and reflect 
on how your new grading scheme might heip prevent this 
weakness from re-occurring. 



Provide scored examples of student work to your present 
students when introducing a new grading scheme. Looking 
at such work and seeing the scores it received can help 
students understand teacher expectations. 

Support students while they are learning by using the type 
of grading scheme that will give them the most meaningful 
feedback. 

For new tasks, try involving the students in helping you create 
the grading scheme. 

Always distribute the grading scheme to students before they 
begin to work. 

Encourage students to self-assess, using the grading scheme 
provided, before handing in work. 



Compiling Grades and Communicating 
Student Achievement 



The following books may help teachers 
reflect upon their current grading 
practices: 

Brookhort, S.M. (2004). Grading. Upper 
Saddle River, NJ: Pearson Merrill 
Prentice Hall. 

Marzano, R.J. (2000). Transforming 
ciassroom grading. Alexandria, VA: 
Association for Supervision and 
Curriculum Development. 

O'Connor, K. (2002). How to grade for 
iearning: Linking grades to 
standards. (2nd Ed). Arlington 
Heights, IL: Skylight Training and 
Publishing, Inc. 



Once all the multiple-choice and performance assessments are 
done, teachers compile individual grades into one overall grade 
for a marking period. The way that this compilation is done is often 

explained in a grading policy state- 
ment, usually distributed at the begin- 
ning of the school term. Overall grades 
then appear on report cards, which are 
sent home to parents. 



RESOURCES 



For parents and students to understand 
how a student is progressing academically, 
they must be able to accurately interpret 
the overall grade shown on the report card. 
Brookhart (2004, p.7) states that "grades 
and other communication about student 
achievement should be based on solid, 
high-quality evidence. Teachers should be 
able to describe that evidence and explain 
how they arrived at any judgments about 
the quality of student work." The grading 
schemes discussed previously in this chap- 
ter will help teachers articulate the types of 
evidences they have collected and explain 
how student work was judged. 



Another grading dilemma to consider is 
how mony grades are needed to constitute the "solid, high-quality 
evidence" that Brookhart recommends. There is no magic number 
of grades; the teacher must be the judge of the amount of grades 
that is sufficient. However, using only one measure (e.g., a one-hour, 
paper-and-pencil exam) to determine a report card grade is clearly 
insufficient evidence. At the other extreme, assessing student per- 
formance daily would not provide students with the time needed to 
develop competences and skills preparatory to being assessed. 



When implementing performance-based assessments, teachers 
may find difficulties in using point systems, checklists, and rubrics, as 
these often contain scores that must be converted to the standard 
grading scale used by the school. Students are more used to getting 





letter grade (A, B, C, D, F) or percentage scores (94, for example). 
Therefore, It Is Important to Include a grade conversion chart with any 
grading scheme. Converted scores must be compiled In some way 
In order to formulate on overall report cord grade. The procedure for 
formulating final grades Is often spelled out In o grading policy state- 
ment. Such grading policies often Include: 

• How missing work will be counted 

• The weight of particular assignments (os tests may count 
more than homework) 

• The grading scale used by the school (percentage points 
needed to get an "A," etc.) 

For on example of the use of o weighting scheme, see the weighting 
system shown In FIGURE 6.7. In this sample, demonstrating knowledge 
on tests represents 37% (100/270 points) of the total grade; science 
process skills, maintaining o journal, and completion of on extended 
group project each represent 19% (50/270 points); and creative writ- 
ing represents 6% (20/270 points). The proportion of the total grade 
accounted for by Individual assessments should communicate the 
relative Importance of different desired outcomes (that Is, more 
Important outcomes carry more weight). 



APPLICATION 



Revise or create a grading poiicy statement for your classes. 
Consider increasing the amount of performance-based assessments 
you will implement this year. 

How might this change your current grading policy? 

How will weights of assignments change? 

How can you use the grading policy to ensure that students actually 
do science? 








FIGURE 6.7 

Sample Grading Period Weighting System 



student A 



DESCRIPTIONS 

OF 

ASSIGNMENTS 


MAXIMUM 

POINTS 

AVAILABLE 


POINTS 

EARNED 


WEIGHT OF 
ASSIGNMENT 


STUDENT A 
SCORE/MAXIMUM 
SCORE 


Paper-and-pencil 
test on electricity 


50 


40 


1 


40/50 


Performance test 
on electricity 
(making circuits) 


25 


20 


2 


40/50 


Weekly lab 
assignments on 
science process 
skills 

(5 assignments X 
10 pts each) 


50 


45 


1 


45/50 


Two creative writing 
tasks (10 pts each) 


20 


20 


1 


20/20 


Journal 


50 


50 


1 


50/50 


Extended group 
project 


50 


45 


1 


45/50 


Totals 


245 


220 




240/270 



The weighting system used in deriving report card grades shouid be 
reiated to the course objectives and expiained to students so that 
they know the goais. in the exampie shown in FIGURE 6.7, students 
might be informed at the beginning of the grading period of the 
instructionai objectives to be taught and the assessments to be 
used. The number of points needed for the different grade symbois 
(number of points to get an A; number of points to get a B, etc.) 
couid aiso be communicated. 

in such a weighting system, it is aiso important to stay fiexibie so as 
not to penaiize students for poor quaiity assessments. For exampie, if 
students were toid that a certain number of points constituted an A, 
but no students earned this many points due to a pooriy constructed 
test, some adjustment to the point system wouid have to be made. 

Student achievement status on important instructionai objectives 
can be communicated in ways other than a singie report card 
grade in science. Some teachers find that grades, aithough required 
by poiicy, are not particuiariy heipfui in conferencing with students 
and parents about students' performance on specific iearning 
goais. The actuai grading schemes (e.g., checkiists, rubrics) as weii 
as anecdotai notes or observation instruments can be used in addi- 
tion to grades or as aiternative means of reporting to parents. 








& EXTW & Started 



Traditional practices in assessment are based on beliefs about the 
purpose of education that are currently being publicly discussed and 
challenged. Performance-based assessments described in this man- 
ual meet some of the identified challenges related to developing 
students' higher order thinking skills. Assessment practices of teachers 
do not necessarily change once people become aware of the need 
for change. Change does not happen the day after an afternoon of 
inservice training. Generally, change is a slowly evolving process that 
occurs through experience, dialogue, and reflection. 

Teachers need time to try new assessments, time to reflect on the 
success or failure of these new methods, and time to make revi- 
sions. Just as student learning is an individual process that is person- 
ally constructed, so is teacher learning about assessment practices. 
Changing assessment practices is not a simple, linear, lock-step 
process that all teachers follow in a prescribed manner. Rather, it is a 
process of becoming more purposeful about: 

• Desired student outcomes in science 

• The design of learning experiences in support of these 
outcomes 

• The use of assessment methods that match well with desired 
outcomes 

• The use of grading systems that reflect student achievement 
on these outcomes 




What are some contexts in which this more purposeful thinking about 
student assessment might be developed? 

Some districts hove initiated district-wide staff development efforts 
in assessment. The literature on professional learning suggests that o 
good staff development program is sustained over time. Teachers ore 
more likely to change in o collegial setting with sustained administra- 
tive support (Loucks-Horsley, Brooks, Carlson, Kuerbis, Marsh, Padilla, 
Pratt, & Smith, 1990). 

This kind of model might involve bringing together o volunteer group 
of lead science teachers from several schools who, with o facilitator: 

• Spend o day on on overview of assessment (outcomes, 
methods, rubrics) os provided in this publication. 

• Spend o day reflecting on science education goals and 
beginning to develop or adopt assessments to try out 
(e.g., observation forms, interview protocols, open-ended 
questions, performance tests, journal criteria, portfolio tasks, 
exhibition and projects). 

• Come together os o group on o regular basis to shore 
experiences, demonstrate the assessments developed and 
the student results obtained, continue to develop or find new 
assessments, and identify areas in which further assistance or 
information is needed. 

The following year, the lead teachers could start o similar process for 
interested science teachers within their own schools. 




Teachers, either individually or in informal groups could begin to 
reflect on their assessment practices. Incorporating performance- 
based assessment into the classroom may be easier if experiences, 
concerns, and frustrations are shared with colleagues. Sharing suc- 
cessful tasks and methods with other teachers also increases the 
number of assessments available. 

There is no right place to start with assessment. There are many activi- 
ties, depending on the prior experience, time constraints, interest, 
and resources of the teachers involved, which represent jumping-off 
points for changing or at least reflecting on assessment practices. 



APPLICATION 



Listed below are some examples of activities that might get conversa- 
tions started about assessment practices: 

• Articulate one very important desired student outcome (refer to 
CHAPTER 2). For example, a teacher might be interested in how well 
students can develop and test hypotheses in the content area 
under study. Review the assessment methods described in 
CHAPTERS 3-5 and choose an approach to assessing students' 
competence on this dimension that has not been tried before. Try 
the assessment approach to see what can be learned about 
student performance and about the assessment method chosen. 

• Experiment with a format for a course syllabus that outlines for 
students the major goals you have for their performance and how 
their performances on these goals will be assessed and report 
card grades will be derived. 

• Start a list of advantages and disadvantages of each of the 
assessment methods described in CHAPTERS 3-5. What do you feel 
you need to know from someone who has tried each method 
before you go any further? 

• Develop a chart (os the one in FIGURE 6.7) showing how you 
combine assessment data in obtaining student report cord 
grades. What kind of weighting system ore you using? 

• Analyze the tests you hove used in the post. Try to improve the 
items used on these test otter referring to the information provide 
in this manual about open-ended questions or performance tasks. 
Consider how you might improve or moke more explicit the 
rubrics for the items. 

• Start a folder of assessment samples from released state tests or 
item bonks, other teachers, district tests, or published articles. 
Critique them for your purposes. Save samples of student work that 
demonstrate different levels of proficiency. Discuss these samples 
with other teachers to gain a shared view of mastery. 

• Review the hands-on, experiential, or lob activities you use with 
your students. Identify the most essential ones, and experiment 
with rubrics that could be used to assess student performance on 
these tasks. 








The process of incorporating and using a broader array of assess- 
ment methods can sharpen teachers' thinking about the meaning of 
student success in science, it can aiso resuit in improvements in the 
quaiity of instruction teachers design for students. Finaiiy, if teachers 
are expiicit and purposefui about their goais, students are more iikeiy 
to evaiuate the quaiity of their own work. 

The benefits of experimenting with a variety of assessment methods 
iie as much in the conversations they engender between teachers 
and students and among teachers as they do in the information they 
provide on student competence. Students as weii as teachers often 
become empowered as assessment becomes a dynamic, interac- 
tive conversation about progress through the use of interviews, jour- 
nais, exhibitions, and portfoiios. Through these assessment methods, 
teachers reiate to students more as faciiitators, coaches, or critics 
rather than as authority figures that dispense aii information and 
knowiedge. 
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The SERVE Center at UNCG, under the leadership of Dr. Ludwig David 
van Broekhuizen, is an education organization with the mission to 
promote and support the continuous improvement of educational 
opportunities for oil learners in the Southeast. The organization's 
commitment to continuous improvement is manifest in on applied 
reseorch-to-proctice model that drives oil of its work. Building on 
research, professional wisdom, and craft knowledge, SERVE staff 
members develop tools, processes, and interventions designed to 
assist practitioners and policymakers with their work. SERVE's ulti- 
mate goal is to raise the level of student achievement in the region. 
Evaluation of the impact of these activities combined with input 
from stakeholders expands SERVE's knowledge base and informs 
future research. 

This rigorous and practical approach to research and development 
is supported by on experienced staff strategically located through- 
out the region. This staff is highly skilled in providing needs assess- 
ment services, conducting applied research in schools, and devel- 
oping processes, products, and programs that support educational 
improvement and increase student achievement. In the lost three 
years, in addition to its basic research and development work with 
over 170 southeastern schools, SERVE staff provided technical assis- 
tance and training to more than 18,000 teachers and administrators 
across the region. 

The SERVE Center is governed by o board of directors that includes 
the governors, chief state school officers, educators, legislators, and 
private sector leaders from Alabama, Florida, Georgia, Mississippi, 
North Carolina, and South Carolina. 

SERVE's operational core is the Regional Educational Laboratory. 
Funded by the U.S. Department of Education's Institute of Education 
Sciences, the Regional Educational Laboratory for the Southeast 
is one often Laboratories providing research-based information 
and services to all 50 states and territories. These Laboratories form 
o nationwide education knowledge network, building a bonk of 
information and resources shared and disseminated nationally 
and regionally to improve student achievement. SERVE's National 
Leadership Area, Expanded Learning Opportunities, focuses on 
improving student outcomes through the use of exemplary pre-K and 
extended-day programs. 
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